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PREFACE 


This is a book on practical feedback control and not on system theory 
generally. Feedback is used in control systems for two reasons: First, to 
change the dynamics of the system — usually, to make the response stable and 
sufficiently fast. Second, to reduce the sensitivity of the system to uncertainty 
— both signal uncertainty (disturbances) and model uncertainty. Important 
topics covered in the book, include 


e classical frequency-domain methods 

analysis of directions in multivariable systems using singular value 
decomposition 

input-output controllability (inherent control limitations in the plant) 
model uncertainty and robustness 

performance requirements 

methods for controller design and model reduction 

control structure selection and decentralized control 


The treatment is limited to linear systems, mainly because the theory is 
then much simpler and more well developed, but also because a large amount 
of practical experience tells us that in many cases linear controllers designed 
using linear methods provide satisfactory performance when applied to real 
nonlinear plants. 

We have attempted to keep the mathematics at a reasonably simple level, 
and we have tried to emphasize results that enhance insight and intuition. The 
design methods currently available for linear systems are well developed, and 
with associated software it is relatively straightforward to design controllers 
for most multivariable plants. However, without insight and intuition it is 
difficult to judge a solution, and to know how to proceed (e.g. how to change 
weights) in order to improve a design. 

The book is appropriate for use as a text for an introductory graduate 
course in multivariable control or for an advanced undergraduate course. We 
also think it will be useful for engineers who want to understand multivariable 
control, its limitations, and how can it be applied in practice. There are 
numerous worked examples, exercises and case studies which make frequent 
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use of MATLAB. 

The prerequisites for reading this book are an introductory course in 
classical single-input single-output (SISO) control and some elementary 
knowledge of matrices and linear algebra. It would be an advantage to have 
some knowledge of system theory or multivariable control, including state- 
space realization theory and optimal control. 

The book is partly based on a graduate multivariable control course given 
by the first author in the Control (Cybernetics) Department at the Norwegian 
University of Science and Technology in Trondheim. About 10 students from 
Electrical, Chemical and Mechanical Engineering have taken the course each 
year since 1989. The course has usually consisted of 3 lectures a week for 12 
weeks. In addition to regular assignments, the students have been required to 
complete a 50 hour design project using MATLAB. In Appendix B, a project 
outline is given together with a sample exam problem. 

By covering parts of the book it should be suitable as a basis for a number of 
linear control courses on various levels: introduction to multivariable control, 
advanced multivariable control, robust control, control system design, and 
control structure design and controllability analysis. 


Examples and ftp 


Most of the numerical examples have been solved using MATLAB. Some 
sample files are included in the text to illustrate the steps involved. Most 
of these files use the p-toolbox, and some the Robust Control toolbox, but in 
most cases the problems could have been solved easily using other software 
packages. 

The following are available over the Internet from Trondheim and Leicester, 


MATLAB files for examples and figures 

Solutions to selected exercises 

Linear state-space models for plants used in the case studies 
Corrections, comments to chapters, extra exercises 


The addresses are: 


Anonymous ftp to ftp.kjemi.unit.no: cd /pub/che/Control.Group/Skogestad 
Web service: http://www.kjemi.unit .no/pub/che/Control.Group/Skogestad/b 


After 1 Sept. 1996: Replace unit by ntnu. 
Comments and questions. Please send questions, errors and any 
comments you may have by email to: 


Sigurd.Skogestad@kjemi.unit.no 


ixp@le.ac.uk 
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INTRODUCTION 


In this chapter, we begin with a brief outline of the design process for control systems. 
We then discuss linear models and transfer functions which are the basic building 
blocks for the analysis and design techniques presented in this book. The scaling of 
variables is critical in applications and so we provide a simple procedure for this. An 
example is given to show how to derive a linear model in terms of deviation variables 
for a practical application. Finally, we summarize the most important notation used 
in the book. 


1.1 The process of control system design 


The process of designing a control system usually makes many demands of 
the engineer or engineering team. These demands often emerge in a step by 
step design procedure as follows: 


1. Study the system (plant) to be controlled and obtain initial information 
about the control objectives. 

2. Model the system and simplify the model, if necessary. 

3. Analyze the resulting model; determine its properties. 

4. Decide which variables are to be controlled (controlled outputs). 

5. Decide on the measurements and manipulated variables: what sensors and 
actuators will be used and where will they be placed? 

6. Select the control configuration. 

7. Decide on the type of controller to be used. 

8. Decide on performance specifications, based on the overall control 
objectives. 

9. Design a controller. 

10. Analyze the resulting controlled system to see if the specifications are 
satisfied; and if they are not satisfied modify the specifications or the type 
of controller. 

11. Simulate the resulting controlled system, either on a computer or a pilot 
plant. 
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12. Repeat from step 2, if necessary. 

13. Choose hardware and software and implement the controller. 

14. Test and validate the control system, and tune the controller on-line, if 
necessary. 


Control courses and text books usually focus on steps 9 and 10 in the 
above procedure; that is, on methods for controller design and control system 
analysis. Interestingly, many real control systems are designed without any 
consideration of these two steps. For example, even for complex systems with 
many inputs and outputs, it may be possible to design workable control 
systems, often based on a hierarchy of cascaded control loops, using only 
on-line tuning (involving steps 1, 4 5, 6, 7, 13 and 14). However, in this case 
a suitable control structure may not be known at the outset, and there is a 
need for systematic tools and insights to assist the designer with steps 4, 5 
and 6. A special feature of this book is the provision of tools for input-output 
controllability analysis (step 3) and for control structure design (steps 4, 5, 6 
and 7). 

Input-output controllability is affected by the location of sensors and 
actuators, but otherwise it cannot be changed by the control engineer. 
Simply stated, “even the best control system cannot make a Ferrari out of a 
Volkswagen”. Therefore, the process of control system design should in some 
cases also include a step 0, involving the design of the process equipment itself. 
The idea of looking at process equipment design and control system design 
as an integrated whole is not new as is clear from the following quote taken 
from a paper by Ziegler and Nichols (1943): 


In the application of automatic controllers, it is important to 
realize that controller and process form a unit; credit or discredit 
for results obtained are attributable to one as much as the 
other. A poor controller is often able to perform acceptably on 
a process which is easily controlled. The finest controller made, 
when applied to a miserably designed process, may not deliver the 
desired performance. True, on badly designed processes, advanced 
controllers are able to eke out better results than older models, 
but on these processes, there is a definite end point which can be 
approached by instrumentation and it falls short of perfection. 


Ziegler and Nichols then proceed to observe that there is a factor in equipment 
design that is neglected, and state that 


...the missing characteristic can be called the “controllability” , 
the ability of the process to achieve and maintain the desired 
equilibrium value. 


To derive simple tools with which to quantify the inherent input-output 
controllability of a plant is the goal of Chapters 5 and 6. 
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1.2 The control problem 


The objective of a control system is to make the output y behave in a 
desired way by manipulating the plant input u. The regulator problem is to 
manipulate u to counteract the effect of a disturbance d. The servo problem is 
to manipulate u to keep the output close to a given reference input r. Thus, in 
both cases we want the control error e = y—r to be small. The algorithm for 
adjusting u based on the available information is the controller K. To arrive 
at a good design for K we need a priori information about the expected d 
and r, and of the plant model (G) and disturbance model (G4). In this book 
we make use of linear models of the form 


y =Gu+Gad (1.1) 


A major source of difficulty is that the models (G, Ga) may be inaccurate or 
may change with time. In particular, inaccuracy in G may cause problems 
because the plant will be part of a feedback loop. To deal with such a 
problem we will make use of the concept of model uncertainty. For example, 
instead of a single model G we may study the behaviour of a class of models, 
G, = G+E, where the “uncertainty” or “perturbation” E is bounded, 
but otherwise unknown. In most cases weighting functions, w(s), are used 
to express F = wA in terms of normalized perturbations, A, where the 
magnitude (norm) of A is less than 1. The following terms are useful: 


Nominal stability (NS). The system is stable with no model uncertainty. 


Nominal Performance (NP). The system satisfies the performance spec- 
ifications with no model uncertainty. 


Robust stability (RS). The system is stable for all perturbed plants about 
the nominal model up to the worst-case model uncertainty. 


Robust performance (RP). The system satisfies the performance speci- 
fications for all perturbed plants about the nominal model up to the 
worst-case model uncertainty. 


We will discuss these terms in detail in Chapters 7 and 8. 


1.3. Transfer functions 


The book makes extensive use of transfer functions, G(s), and of the frequency 
domain, which are very useful in applications for the following reasons: 


e Invaluable insights are obtained from simple frequency-dependent plots. 
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e Important concepts for feedback such as bandwidth and peaks of closed- 
loop transfer functions may be defined. 

e G(jw) gives the steady-state response to a sinusoidal input of frequency w. 

e A series interconnection of systems corresponds in the frequency domain to 
multiplication of the individual system transfer functions, whereas in the 
time domain the evaluation of complicated convolution integrals is required. 

e Poles and zeros appear explicitly in factorized scalar transfer functions. 

e Uncertainty is more easily handled in the frequency domain. This is related 
to the fact that two systems can be described as close (i.e. have similar 
behaviour) if their frequency responses are similar. On the other hand, a 
small change in a parameter in a state-space description can result in an 
entirely different system response. 


We consider linear, time-invariant systems whose input-output responses 
are governed by linear ordinary differential equations with constant 
coefficients. An example of such a system is 


£41 (t) => ayry, (t) vt (t) } by u(t) 
Lo (t) = —a0X1 (t) he bgu(t) (1.2) 
y(t) = a(t) 


where «(t) = dx/dt. Here u(t) represents the input signal, x,(t) and 2x2(t) 
the states, and y(t) the output signal. The system is time-invariant since the 
coefficients a1,@2,b; and bz are independent of time. If we apply the Laplace 
transform to (1.2) we obtain indexLaplace transform 


s%(s)—a,(t=0) = —a,%)(s) + £o(s) + by ti(s) 
8Ho(s)—ao(t=0) = —ao%1(s) + boti(s) (1.3) 
y(s) = #1(s) 


where y(s) denotes the Laplace transform of y(t), and so on. To simplify our 
presentation we will make the usual abuse of notation and replace y(s) by 
y(s), etc.. In addition, we will omit the independent variables s and t when 
the meaning is clear. 

If u(t),a1(t),v2(t) and y(t) represent deviation variables away from a 
nominal operating point or trajectory, then we can assume 21 (¢ = 0) = a(t = 
0) =0. The elimination of 71 (s) and r(s) from (1.3) then yields the transfer 
function 

y(s) (s) =: bys + bo 
~~ §2 +418 +a 
Importantly, for linear systems, the transfer function is independent of the 
input signal (forcing function). Notice that the transfer function in (1.4) may 
also represent the following system 


(1.4) 


Gt) + ary(t) + ary(t) = biu(t) + bou(t) (1.5) 
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with input u(t) and output y(t). 

Transfer functions, such as G(s) in (1.4), will be used throughout the book 
to model systems and their components. More generally, we consider rational 
transfer functions of the form 


Bn 8"* +++++ Bis + Bo 
8s” + ayn_18"-1 + ---+a15 +40 


G(s) = 


(1.6) 


For multivariable systems, G(s) is a matrix of transfer functions. In (1.6) n is 
the order of the denominator or pole polynomial and is also called the order 
of the system, and n, is the order of the numerator or zero polynomial. Then 
nm — nz, is referred to as the pole excess or relative order. 

For a proper system, with n > nz, we may realize (1.6) by a state-space 
description, ¢ = Ar+Bu, y = Cx+Du, similar to (1.2). The transfer function 
may then be written as G(s) = C(sI — A)-'B + D, similar to (1.8). 


Definition 1.1 


s§ 
s 
s§ 
s§ 


A system G 
A system G 
A system G 
A system G 


is strictly proper if G(s) + 0 as s > oo. 

is semi-proper or bi-proper if G(s) > D £0 as s > &w. 
which is strictly proper or semi-proper is proper. 

is improper if G(s) 4 co as s > oo. 


e@oeee 
NN Nm 
Sp SN 


All practical systems will have zero gain at a sufficiently high frequency, 
and are therefore strictly proper. It is often convenient, however, to model 
high frequency effects by a non-zero D-term, and hence semi-proper models 
are frequently used. Furthermore, certain derived transfer functions, such as 
S = (I1+GK) 1, are semi-proper. 

Usually we use G(s) to represent the effect of the inputs u on the outputs 
y, whereas G(s) represents the effect on y of the disturbances d. We then 
have the following linear process model in terms of deviation variables 


y(s) = G(s)u(s) + Ga(s)d(s) (1.7) 


Here we have made use of the superposition principle for linear systems, which 
implies that a change in a dependent variable (y) can simply be found by 
adding together the separate effects resulting from changes in the independent 
variables (u and d) considered one at a time. 

All the signals u(s), d(s) and y(s) are deviation variables. This is sometimes 
shown explicitly, for example, by use of the notation du(s), but since we always 
use deviation variables when we consider Laplace transforms, the 6 is normally 
omitted. 
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1.4 Scaling 


Scaling is very important in practical applications as it makes model analysis 
and controller design (weight selection) much simpler. It requires the engineer 
to make a judgement at the start of the design process about the required 
performance of the system. To do this, decisions are made on the expected 
magnitudes of disturbances and reference changes, on the allowed magnitude 
of each input signal, and on the allowed deviation of each output. 

Let the unscaled (or originally scaled) linear model of the process in 
deviation variables be 


g=Gi+Gad; @=9-7 (1.8) 


where a hat (~ ) is used to show that the variables are in their unscaled 
(original) units. A useful approach for scaling is to make the variables less 
than one in magnitude. This is done by dividing each variable by its maximum 
expected or allowed change. For disturbances and manipulated inputs, we use 
the scaled variables 


d=d/dmax, U=%/fimax (1.9) 
where 
e donax — largest expected change in disturbance 
© imax — largest allowed input change 


The maximum deviation from a nominal value should be chosen by thinking 
of the maximum value one can expect, or allow, as a function of time. 

The variables 7, € and 7 are in the same units, so the same scaling factor 
should be applied to each. Two alternatives are possible: 


© max — largest allowed control error 
© fmax — largest expected change in reference value 


Since a major objective of control is to minimize the control error €, we here 
usually choose to scale with respect to the maximum control error: 


y=G/emax, T=T/Emax, € = €/€max (1.10) 


To formalize the scaling procedure, introduce the scaling factors 


“A 


D_, = Emax, Du = Umax, Da = dmax:, D, — Tmax (1.11) 


For MIMO systems each variable in the vectors d,?,@ and @ may have a 
different maximum value, in which case D., Dy, Dg and D, become diagonal 
scaling matrices. This ensures, for example, that all errors (outputs) are of 
about equal importance in terms of their magnitude. 
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The corresponding scaled variables to use for control purposes are then 
d=D71d, u= D714, y= Dz'f, e= D7, r = DF (1.12) 


€ 


On substituting (1.12) into (1.8) we get 
Dey = GDyu + GaDad; D.e = Dey — Der 
and introducing the scaled transfer functions 
GSP GD: “Cre Ds Gi, (1.13) 
then yields the following model in terms of scaled variables 
y=Gu+Gad; e=y-r (1.14) 


Here u and d should be less than 1 in magnitude, and it is useful in some cases 
to introduce a scaled reference 7, which is less than 1 in magnitude. This is 
done by dividing the reference by the maximum expected reference change 


F =F /Pmax = Dp‘? (1.15) 
We then have that 


r= RF where R2D;!D, =Finax/@max (1.16) 


Here R is the largest expected change in reference relative to the allowed 


d 


RV 


Figure 1.1: Model in terms of scaled variables. 


control error; typically, R > 1. The block diagram for the system in scaled 
variables may then be written as in Figure 1.1, for which the following control 
objective is relevant: 


e In terms of scaled variables we have that |d(t)| 
our control objective is to design u with |u(t)| 
ly(t) — r(t)| <1 (at least most of the time). 


1 and |r(t)| < 1, and 


< 
< 1 such that |e(t)| = 
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Remark 1 A number of the interpretations used in the book depend critically 
on a correct scaling. In particular, this applies to the input-output controllability 
analysis presented in Chapters 5 and 6. Furthermore, for a MIMO system one cannot 
correctly make use of the sensitivity function S = (J +GK)~' unless the output 
errors are of comparable magnitude. 


Remark 2 With the above scalings, the worst-case behaviour of a system is 
analyzed by considering disturbances d of magnitude 1, and references 7 of 
magnitude 1. 


Remark 3 With r = Rr the control error may be written as 


e=y—7r=Gut+Gad — Rr (1.17) 


We see that a normalized reference change 7 may be viewed as a special case of a 
disturbance with Gg = —R, where R is usually a constant diagonal matrix. We will 
sometimes use this to unify our treatment of disturbances and references. 


Remark 4 The above scaling of the outputs is used when analyzing a given plant. 
However, if the issue is to select which outputs to control, see Section 10.3, then it 
one may choose to scale the outputs with respect to their expected variation (which 
is similar to Tmax). 


Remark 5 If the expected or allowed variation of a variable about 0 (its nominal 
value) is not symmetric then the largest variation should be used for ice and the 
smallest variation for Umax and €max. For example, if the disturbance is —5 < a< 10 
then dmax = 10, and if the manipulated input is —5 < u < 10 then Umax = 5. This 
approach may be conservative (in terms of allowing too large disturbances etc.) 
when the variations in several variables are not symmetric. 


A further discussion on scaling and performance is given in Chapter 5 on 
page 172. 


1.5 Deriving linear models 


Linear models may be obtained from physical “first-principle” models, from 
analyzing input-output data, or from a combination of these two approaches. 
Although modelling and system identification are not covered in this book, it 
is always important for a control engineer to have a good understanding of a 
model’s origin. The following steps are usually taken when deriving a linear 
model for controller design based on a first-principle approach: 


1. Formulate a nonlinear state-space model based on physical knowledge. 
2. Determine the steady-state operating point (or trajectory) about which to 
linearize. 
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3. Introduce deviation variables and linearize the model. There are essentially 
three parts to this step: 


(a) Linearize the equations using a Taylor expansion where second and 
higher order terms are omitted. 
(b) Introduce the deviation variables by substitution, e.g., d2(t) defined by 


a(t) = «* + 6x(t) 


where the superscript * denotes the steady-state operating point or 
trajectory along which we are linearizing. 

(c) Subtract the steady-state to eliminate the terms involving only steady- 
state quantities. 


These parts are usually accomplished together. For example, for a nonlinear 
state-space model of the form 


dx 
a= f(z,u) (1.18) 


the linearized model in deviation variables (62, du) is 


“ar = (ae) + Ge) (1.19) 


Ves Se 
B 


Here x and u may be vectors, in which case the Jacobians A and B are 
matrices. 

4. Scale the variables to obtain scaled models which are more suitable for 
control purposes. 


In most cases steps 2 and 3 are performed numerically based on the model 
obtained in step 1. Also, since (1.19) is in terms of deviation variables, its 
Laplace transform becomes sdéx(s) = Adx(s) + Bou(s), or 


dx(s) = (sI — A)~' Béou(s) (1.20) 


Example 1.1 Physical model of a room heating process. 

The above steps for deriving a linear model will be illustrated on the simple example 
depicted in Figure 1.2, where the control problem is to adjust the heat input Q to 
maintain constant room temperature T. The outdoor temperature T, is the main 
disturbance. Units are shown in square brackets. 

1. Physical model. An energy balance for the room requires that the change 
in energy in the room must equal the net inflow of energy to the room (per unit of 
time). This yields the following state-space model 


d 
acy) =Q+a(T, — T) (1.21) 
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To[K] 


Q|W] 


Figure 1.2: Room heating process. 


where T [K] is the room temperature, Cy [J/K] is the heat capacity of the room, Q 
[W] is the heat input (from some heat source), and the term a(To—T) [W] represents 
the net heat loss due to exchange of air and heat conduction through the walls. 

2. Operating point. Consider a case where the heat input Q* is 2000 W and the 
difference between indoor and outdoor temperatures T* —T5 is 20K. Then the steady- 
state energy balance yields a* = 2000/20 = 100 [W/K]. We assume the room heat 
capacity is constant, Cy = 100 /kJ/K]. (This value corresponds approximately to the 
heat capacity of air in a room of about 100m?; thus we neglect heat accumulation in 
the walls.) 

3. Linear model in deviation variables. If we assume a is constant the model 
in (1.21) is already linear. Then introducing deviation variables 


éT(t) = T(t) — T(t), 6Q(t) = Q(t) — Q*(t), 6To(t) = To(t) — To (t) 
yields 
Cy < srt) = §Q(t) + o(8T,(t) — ST(t)) (1.22) 
Remark. /f a depended on the state variable (T in this example), or on one of the 


independent variables of interest (Q or Tp in this example), then one would have to 
include an extra term (T* — Tj )éa(t) on the right hand side of Equation (1.22). 


On taking Laplace transforms in (1.22), assuming 6T(t) = 0 at t = 0, and 
rearranging we get 


1 
Ts+1 


_ov 


6T(s) = = 


1 
(601s) 6To(s)) _. (1.23) 
The time constant for this example is t = 100-107 /100 = 1000 [s] ~ 17 [min] which 
is reasonable. It means that for a step increase in heat input it will take about 17min 


for the temperature to reach 63% of its steady-state increase. 
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4. Linear model in scaled variables. Introduce the following scaled variables 


(8) = u(s) = 320), as) = ge) (1.24) 


In our case the acceptable variations in room temperature T are +1K, i.e. 6Tmax = 
6€max = 1 [K]. Furthermore, the heat input can vary between OW and 6000W, and 
since its nominal value is 2000W we have 6Qmax = 2000 [W] (see Remark5 on 8). 
Finally, the expected variations in outdoor temperature are 10K, i.e. 6To,max = 10 
[K]. The model in terms of scaled variables then becomes 


ee a) 
Ge) 75416Tmax @ 1000841 
1: 60) fe 10 
Ga(s) 73841 6Tmax 1000841 (2) 


Note that the static gain for the input is k = 20, whereas the static gain for the 
disturbance is kq = 10. The fact that |ka| > 1 means that we need some control 
(feedback or feedforward) to keep the output within its allowed bound (\e| <1) when 
there is a disturbance of magnitude |d| = 1. The fact that |k| > |ka| means that 
we have enough “power” in the inputs to reject the disturbance at steady state, that 
is, we can, using an input of magnitude |u| < 1, have perfect disturbance rejection 
(e = 0) for the mazimum disturbance (\d| = 1). We will return with a detailed 
discussion of this when we consider input-output controllability analysis in Chapter 
5. The room heating process is considered in detail in Section 5.16.2. 


1.6 Notation 


There is no standard notation to cover all of the topics covered in this book. 
We have tried to use the most familiar notation from the literature whenever 
possible, but an overriding concern has been to be consistent within the book, 
to ensure that the reader can follow the ideas and techniques through from 
one chapter to another. 

The most important notation is summarized in Figure 1.3, which shows 
a one degree-of-freedom control configuration with negative feedback, a two 
degrees-of-freedom control configuration, and a general control configuration. 

The latter can be used to represent a wide class of controllers, including 
the one and two degrees-of-freedom configurations, as well as feedforward and 
estimation schemes and many others; and, as we will see, it can also be used 
to formulate optimization problems for controller design. The symbols used 
in Figure 1.3 are defined in Table 1.1. Apart from the use of v to represent 
the controller inputs for the general configuration, this notation is reasonably 
standard. 

Lower-case letters are used for vectors and signals (e.g., u, y, n), and 
capital letters for matrices, transfer functions and systems (e.g., G, K). Matrix 
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(c) General control configuration 


Figure 1.3: Control configurations 
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Table 1.1: Nomenclature 


controller, in whatever configuration. Sometimes the controller is 


broken down into its constituent parts. For example, in the two 
Ky 


degrees-of-freedom controller in Figure 1.3 (b), K = re 
y 


where 


kK, is a prefilter and Ky is the feedback controller. 


For the conventional control configurations (Figure 1.3 (a) and (b)): 


G 
Ga 


r 


d 
n 
y 


Ym 
U 


plant model 

disturbance model 

reference inputs (commands, setpoints) 

disturbances (process noise) 

measurement noise 

plant outputs. These signals include the variables to be controlled 
(“primary” outputs with reference values r) and possibly some 
additional “secondary” measurements to improve control. Usually 
the signals y are measurable. 

measured y 

control signals (manipulated plant inputs) 


For the general control configuration (Figure 1.3 (c)): 


P 


XR 


generalized plant model. It will include G and Gq and the 
interconnection structure between the plant and the controller. In 
addition, if P is being used to formulate a design problem, then it 
will also include weighting functions. 

exogenous inputs: commands, disturbances and noise 

exogenous outputs; “error” signals to be minimized, e.g., y— Tr. 
controller inputs for the general configuration, e.g., commands, 
measured plant outputs, measured disturbances, etc. For the 
special case of a one degree-of-freedom controller with perfect 
measurements we have v =r — y. 
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elements are usually denoted by lower-case letters, so gj; is the ij’th element in 
the matrix G'. However, sometimes we use upper-case letters G';;, for example if 
G is partitioned so that Gj; is itself a matrix, or to avoid conflicts in notation. 
The Laplace variable s is often omitted for simplicity, so we often write G 
when we mean G(s). 

For state-space realizations we use the standard (A, B, C, D)-notation. That 
is, a system G with a state-space realization (A,B,C,D) has a transfer 
function G(s) = C(sI — A)-1B + D. We sometimes write 


G(s) = Lets | (1.26) 


to mean that the transfer function G(s) has a state-space realization given by 
the quadruple (A, B,C, D). 

For closed-loop transfer functions we use S to denote the sensitivity at the 
plant output, and T to denote the complementary sensitivity. With negative 
feedback, S = (I+L)~1 and T = L(I+L)~1, where L is the transfer function 
around the loop as seen from the output. In most cases L = GK, but if 
we also include measurement dynamics (Ym = Gmy +n) then L = GKG yp. 
The corresponding transfer functions as seen from the input of the plant are 
Lr=KG (or Lr= KGmG), Sr= (I+ Ly)" and T; = Lr(t +Ly7)1. 

To represent uncertainty we use perturbations E (not normalized) or A 
(normalized such that their magnitude is less than one). The nominal plant 
model is G, whereas the perturbed model with uncertainty is denoted G, 
(usually for a set of possible perturbed plants) or G’ (usually for a particular 
perturbed plant). For example, with additive uncertainty we may have G’ = 
G+E,=G+waAaz, where wy is weight representing the magnitude of the 
uncertainty. 

By the right half plane (RHP) we mean the closed right half of the complex 
plane, including the imaginary axis (jw-axis). The left half plane (LHP) is the 
open left half of the complex plane, excluding the imaginary axis. A RHP- 
pole (unstable pole) is a pole located in the right half plane, and thus includes 
poles on the imaginary axis. Similarly, a RHP-zero (“unstable” zero) is a zero 
located in the right half plane. 

We use A’ to denote the transpose of a matrix A, and A” to represent its 
complex conjugate transpose. 


Mathematical terminology 


The symbol 5 is used to denote equal by definition, and is used to denote 
equivalent by definition. 

Let A and B be logic statements. Then the following expressions are 
equivalent: 
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A<=B 

A if B, or: If B then A 

A is necessary for B 
B = A, or: B implies A 

B is sufficient for A 

B only if A 
not A > not B 
The remaining notation, special terminology and abbreviations will be 
defined in the text. 
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2 


CLASSICAL FEEDBACK 
CONTROL 


In this chapter, we review the classical frequency-response techniques for the 
analysis and design of single-loop (single-input single-output, SISO) feedback control 
systems. These loop-shaping techniques have been successfully used by industrial 
control engineers for decades, and have proved to be indispensable when it comes 
to providing insight into the benefits, limitations and problems of feedback control. 
During the 1980’s the classical methods were extended to a more formal method 
based on shaping closed-loop transfer functions, for example, by considering the 
Hoo norm of the weighted sensitivity function. We introduce this method at the end 
of the chapter. 

The same underlying ideas and techniques will recur throughout the book as we 
present practical procedures for the analysis and design of multivariable (multi-input 
multi-output, MIMO) control systems. 


2.1 Frequency response 


On replacing s by jw in a transfer function model G(s) we get the so- 
called frequency response description. Frequency responses can be used to 
describe: 1) a system’s response to sinusoids of varying frequency, 2) the 
frequency content of a deterministic signal via the Fourier transform, and 
3) the frequency distribution of a stochastic signal via the power spectral 
density function. 

In this book we use the first interpretation, namely that of frequency-by- 
frequency sinusoidal response. This interpretation has the advantage of being 
directly linked to the time domain, and at each frequency w the complex 
number G'(jw) (or complex matrix for a MIMO system) has a clear physical 
interpretation. It gives the response to an input sinusoid of frequency w. This 
will be explained in more detail below. For the other two interpretations we 
cannot assign a clear physical meaning to G(jw) or y(jw) at a particular 
frequency — it is the distribution relative to other frequencies which matters 
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then. 

One important advantage of a frequency response analysisof a system is 
that it provides insight into the benefits and trade-offs of feedback control. 
Although this insight may be obtained by viewing the frequency response 
in terms of its relationship between power spectral densities, as is evident 
from the excellent treatment by Kwakernaak and Sivan (1972), we believe 
that the frequency-by-frequency sinusoidal response interpretation is the most 
transparent and useful. 


Frequency-by-frequency sinusoids 


We now want to give a physical picture of frequency response in terms of a 
system’s steady-state response to persistent sinusoids. It is important that 
the reader has this picture in mind when reading the rest of the book. For 
example, it is needed to understand the response of a multivariable system 
in terms of its singular value decomposition. A physical interpretation of the 
frequency response for a stable linear system y = G(s)u is a follows. Apply a 
sinusoidal input signal with frequency w [rad/s] and magnitude up, that is, 


u(t) = uo sin(wt + a) 


The input signal is persistent, that is, it has been applied since t = —oo. 
Then as t > © (i.e., after some initial period when the response is more 
complicated) the steady-state output signal is a sinusoid of the same frequency, 
namely 

y(t) = yo sin wt + 3) 


Here up and yo represent magnitudes and are therefore both non-negative. 
Note that the output sinusoid has a different amplitude yo and is also shifted 
in phase from the input by 


¢2p-a 


Importantly, it can be shown that y,/u, and ¢ can be obtained directly from 
the Laplace transform G(s) after inserting the imaginary number s = jw and 
evaluating the magnitude and phase of the resulting complex number G(jw). 
We have 

yo /to = |Gljw)|; = LG(jw) [rad] (2.1) 


For example, let G(jw) = a+ jb, with real part a = ReG(jw) and imaginary 
part b = ImG(jw), then 


IG(jw)| = Va2+0?; LG(jw) = arctan(b/a) (2.2) 


In words, (2.1) says that after sending a sinusoidal signal through a system 
G(s), the signal’s magnitude is amplified by a factor |G(jw)| and its phase is 
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shifted by /G(jw). In Figure 2.1, this statement is illustrated for the following 
first-order delay system (time in seconds), 


ke~% 


a kh 50. = 27 = 10 2.3 
tTs+1’ ? 97 ( ) 


G(s) = 


At frequency w = 0.2 [rad/s], we see that the output y lags behind the 
input by about a quarter of a period and that the amplitude of the output is 
approximately twice that of the input. More accurately, the amplification is 


IG(jw)| = k// (rw)? +1 = 5//(0w)? +1 = 2.24 


and the phase shift is 


¢ = LG(jw) = — arctan(Tw)—6w = — arctan(10w) —2w = —1.5lrad = —86.4° 


Time [sec] 


Figure 2.1: Sinusoidal response for system G(s) = 5e~7*/(10s + 1) at frequency 
w = 0.2 [rad/s]. 


G(jw) is called the frequency response of the system G(s). It describes 
how the system responds to persistent sinusoidal inputs of frequency w. The 
magnitude of the frequency response, |G(jw)|, being equal to |yo(w)|/|uo(w)|, 
is also referred to as the system gain. Sometimes the gain is given in units of 
dB (decibel) defined as 

A [dB] = 20log,, A (2.4) 


For example, A = 2 corresponds to A = 6.02 dB, and A = V2 corresponds to 
A= 3.01 dB, and A = 1 corresponds to A = 0 dB. 

Both |G(jw)| and ZG(jw) depend on the frequency w. This dependency 
may be plotted explicitly in Bode plots (with w as independent variable) 
or somewhat implicitly in a Nyquist plot (phasor diagramIn Bode plots we 
usually employ a log-scale for frequency and gain and a linear scale for the 
phase. In Figure 2.2, the Bode plots are shown for the system in (2.3). We note 
that in this case both the gain and phase fall monotonically with frequency. 
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Magnitude 


10° 10° 10! 10 10 


10° 10° PS y 


IC 
Frequency [rad/s] 


Figure 2.2: Frequency response (Bode plots) of G(s) = 5e~** /(10s + 1). 


This is quite common for process control applications. The delay 6 only shifts 
the sinusoid in time, and thus affects the phase but not the gain. The system 
gain |G(jw)| is equal to k at low frequencies; this is the steady-state gain and 
is obtained by setting s = 0 (or w = 0). The gain remains relatively constant 
up to the break frequency 1/7 where it starts falling sharply. Physically, the 
system responds too slowly to let high-frequency (“fast”) inputs have much 
effect on the outputs, and sinusoidal inputs with w > 1/7 are attenuated by 
the system dynamics. 

The frequency response is also useful for an unstable plant G(s), which by 
itself has no steady-state response. Let G(s) be stabilized by feedback control, 
and consider applying a sinusoidal forcing signal to the stabilized system. In 
this case all signals within the system are persistent sinusoids with the same 
frequency w, and G(jw) yields as before the sinusoidal response from the input 
to the output of G(s). 

Phasor notation. From Euler’s formula for complex numbers we have that 
e/* =cosz+jsinz. It then follows that sin(wt) is equal to the imaginary part 
of the complex function e/“*, and we can write the time domain sinusoidal 
response in complex form as follows: 


u(t) = upIme)'+®) gives ast 3 co y(t) = yolmet(“!+9) (2.5) 
where 
yo = |G(jw)|uo, B= LG(jw) +a (2.6) 


and |G(jw)| and /G(jw) are defined in (2.2). Now introduce the complex 


numbers vi ” 
u(w) = upe’*,  -y(w) = yoe!® (2.7) 
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where we have used w as an argument because yo and 3 depend on frequency, 
and in some cases so may ug and a. Note that u(w) is not equal to u(s) 
evaluated at s = w nor is it equal to u(t) evaluated at t = w. Since 


G(jw) = |G(jw)| e9440") the sinusoidal response in (2.5) and (2.6) can then 
be written on complex form as follows 


y(wjer a G(jw)u(w)er* (2.8) 


or because the term e/“ appears on both sides 


y(w) = G(jw)u(w) (2.9) 


which we refer to as the phasor notation. At each frequency, u(w), y(w) and 
G(jw) are complex numbers, and the usual rules for multiplying complex 
numbers apply. We will use this phasor notation throughout the book. Thus 
whenever we use notation such as u(w) (with w and not jw as an argument), 
the reader should interpret this as a (complex) sinusoidal signal, u(w)es'. 
(2.9) also applies to MIMO systems where u(w) and y(w) are complex vectors 
representing the sinusoidal signal in each channel and G(jw) is a complex 
matrix. 

Minimum phase systems. For stable systems which are minimum phase 
(no time delays or right-half plane (RHP) zeros) there is a unique relationship 
between the gain and phase of the frequency response. This may be quantified 
by the Bode gain-phase relationship which gives the phase of G (normalized! 
such that G'(0) > 0) at a given frequency wp as a function of |G(jw)| over the 
entire frequency range: 


LG(jwo) = : / emilee)! In ot 


dinw wW— Wo 


dw 
- (2.10) 


Nw) 


The name minimum phase refers to the fact that such a system has the 
minimum possible phase lag for the given magnitude response |G(jw)|. The 
term N(w) is the slope of the magnitude in log-variables at frequency w. In 
particular, the local slope at frequency wo is 


dn |G(jw)| 
N = ——— 2.11 
(0) ( dlnw ar ( ) 
The term In aan in (2.10) is infinite at w = wo, so it follows that ZG(jwy) is 
primarily determined by the local slope N(wo). Also f°. In oreo. aS a 


l The normalization of G(s) is necessary to handle systems such as ; Ls and = which 
have equal gain, are stable and minimum phase, but their phases differ by 180°. Systems 


with integrators may be treated by replacing 4 by where € is a small positive number. 


ste 
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which justifies the commonly used approximation for stable minimum phase 
systems 


LG (jw) © SN (wo) [rad] = 90° - N(wy) 


The approximation is exact for the system G(s) = 1/s” (where N(w) = —n), 
and it is good for stable minimum phase systems except at frequencies close 
to those of resonance (complex) poles or zeros. 

RHP-zeros and time delays contribute additional phase lag to a system when 
compared to that of a minimum phase system with the same gain (hence the 
term non-minimum phase system). For example, the system G(s) = —3+* with 
a RHP-zero at s = a has a constant gain of 1, but its phase is —2 arctan(w/a) 
[rad] (and not 0 [rad] as it would be for the minimum phase system G(s) = 1 
of the same gain). Similarly, the time delay system e~%* has a constant gain 
of 1, but its phase is —w6 [rad]. 


a 
i=) 


Magnitude 


Phase 


Frequency [rad/s] 


Figure 2.3: Bode plots of transfer function L; = 30 ante: The asymptotes 


are given by dotted lines. The vertical dotted lines on the upper plot indicate the 
break frequencies w1, w2 and w3. 


Straight-line approximations. For the design methods used in this book 
it is useful to be able to sketch quickly Bode plots, and in particular the 
magnitude (gain) diagram. The reader is therefore advised to become familiar 
with asymptotic Bode plots (straight-line approximations). For example, for 
a transfer function 


Gig ee (2.12) 

(8 + pi) (8 + pa) °° 
the asymptotic Bode plots of G(jw) are obtained by approximating each 
term s+a by jw+aws a for w < aand by juw+a & jw for w > a. 
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These approximations yield straight lines on a log-log plot which meet at the 
so-called break point frequency w = a. In (2.12) therefore, the frequencies 


21, 22,-+--,P1,P2,--. are the break points where the asymptotes meet. For 
complex poles or zeros, the term s? + 2¢swo + w3 (where |¢| < 1) is 
approximated by w? for w < wo and by s? = (jw)? = —w? for w > uy. 


The magnitude of a transfer function is usually close to its asymptotic value, 
and the only case when there is significant deviation is around the resonance 
frequency wo for complex poles or zeros with a damping |¢| of about 0.3 or 
less. In Figure 2.3, the Bode plots are shown for 


(s +1) 
(s + 0.01)?(s + 10) 


I1(s) = 30 (2.13) 


The asymptotes are shown by dotted lines. We note that the magnitude 
follows the asymptotes closely, whereas the phase does not. In this example 
the asymptotic slope of L; is 0 up to the first break frequency at 0.01 rad/s 
where we have two poles and then the slope changes to N = —2. Then at 
1 rad/s there is a zero and the slope changes to N = —1. Finally, there is 
a break frequency corresponding to a pole at 10 rad/s and so the slope is 
N = —2 at this and higher frequencies. 


2.2 Feedback control 


Figure 2.4: Block diagram of one degree-of-freedom feedback control system. 
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2.2.1 One degree-of-freedom controller 


In most of this chapter, we examine the simple one degree-of-freedom negative 
feedback structure shown in Figure 2.4. The input to the controller K(s) is 
T —Ym Where Ym = y +n is the measured output and n is the measurement 
noise. Thus, the input to the plant is 


u= K(s)(r—y-—n) (2.14) 


The objective of control is to manipulate u (design A’) such that the control 
error € remains small in spite of disturbances d. The control error e is defined 
as 

e=y-r (2.15) 


where r denotes the reference value (setpoint) for the output. Note that we 
do not define e as the controller input r — ym which is frequently done. 


2.2.2 Closed-loop transfer functions 
The plant model is written as 
y = G(s)u+Ga(s)d (2.16) 


and for a one degree-of-freedom controller the substitution of (2.14) into (2.16) 
yields 


y =GK(r—y—n) + Gad 


or 
(1+GK)y=GKr+Gid—GKn (2.17) 


and hence the closed-loop response is 


y= (I14+GK)7GKr +(I+GK)"Gid-(1+GK)GKn (2.18) 
—— e——{’——=S—_— —— 
T Ss T 


The control error is 


e=y-r=—Sr+SGad—Tn (2.19) 


where we have used the fact T—J = —S. The corresponding plant input signal 
is 


u= KSr—KSGad—KSn (2.20) 


The following notation and terminology are used 


L=GK _ l|oop transfer function 
S=(1+GK)'=(I1+L)"1 | sensitivity function 
T =(I1+GK)'!GK =(I+L)"1L complementary sensitivity function 
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We see that S is the closed-loop transfer function from the output disturbances 
to the outputs, while T is the closed-loop transfer function from the reference 
signals to the outputs. The term complementary sensitivity for T follows from 
the identity: 

S4+T=I (2.21) 


To derive (2.21), write $+ T = (I+ L)-14+(1+L)"1L and factor out the 
term (I + L)~!. The term sensitivity function is natural because S gives the 
sensitivity reduction afforded by feedback. To see this, consider the “open- 
loop” case i.e. with no feedback. Then 


y=GKr+Gad+0-n (2.22) 


and a comparison with (2.18) shows that, with the exception of noise, the 
response with feedback is obtained by premultiplying the right hand side by 
S. 


Remark 1 Actually, the above is not the original reason for the name “sensitivity” . 
Bode first called S sensitivity because it gives the relative sensitivity of the closed- 
loop transfer function T to the relative plant model error. In particular, at a given 
frequency w we have for a SISO plant, by straightforward differentiation of T, that 


aT /T 

—_— =S 2.23 
dG/G ned 
Remark 2 Equations (2.14)-(2.22) are written in matrix form because they also 
apply to MIMO systems. Of course, for SISO systems we may write $+ 7 = 1, 


ae mews 
S = ay, T = pz and s0 on. 


Remark 3 In general, closed-loop transfer functions for SISO systems with 
negative feedback may be obtained from the rule 
“direct” 
OUTPUT = ————_.- INPUT (2.24) 
1 of “loop” 
where “direct” represents the transfer function for the direct effect of the input on 
the output (with the feedback path open) and “loop” is the transfer function around 
the loop (denoted L(s)). In the above case L = GK. If there is also a measurement 
device, G(s), in the loop, then L(s) = GKG,. The rule in (2.24) is easily derived 
by generalizing (2.17). In Section 3.2, we present a more general form of this rule 
which also applies to multivariable systems. 


2.2.3. Why feedback? 


At this point it is pertinent to ask why we should use feedback control at 
all — rather than simply using feedforward control. A “perfect” feedforward 
controller is obtained by removing the feedback signal and using the controller 
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K(s) = G1‘(s) (we assume for now that it is possible to obtain and 
physically realize such an inverse, although this may of course not be true). 
We also assume that the plant and controller are both stable and that all the 
disturbances are known, that is, we know Gud, the effect of the disturbances 
on the outputs. Then with r — Gad as the controller input, this feedforward 
controller would yield perfect control assuming G was a perfect model of the 
plant: 

y= Gu+Gad= GK(r = Gad) +Gad=r (2.25) 


Unfortunately, G is never an exact model, and the disturbances are never 
known exactly. The fundamental reasons for using feedback control are 
therefore the presence of 


1. Signal uncertainty - Unknown disturbance 
2. Model uncertainty 
3. An unstable plant 


The third reason follows because unstable plants can only be stabilized by 
feedback (see internal stability in Chapter 4). 

The ability of feedback to reduce the effect of model uncertainty is of 
crucial importance in controller design. One strategy for dealing with model 
uncertainty is to approximate its effect on the feedback system by adding 
fictitious disturbances or noise. For example, this is the only way of handling 
model uncertainty within the so-called LQG approach to optimal control (see 
Chapter 9). Is this an acceptable strategy? In general, the answer is no. This 
is easily illustrated for linear systems where the addition of disturbances does 
not affect system stability, whereas model uncertainty combined with feedback 
may easily create instability. For example, consider a perturbed plant model 
G, =G+E where E represents additive model uncertainty. Then the output 
of the perturbed plant is 


y= Gu + dy + dg; dy, = Eu, dy = Gad (2.26) 
where y is different from what we ideally expect (namely Gu) for two reasons: 


1. Uncertainty in the model (d;). 
2. Signal uncertainty (d2) 


In LQG control we set wg = d,; +d where wg is assumed to be an independent 
variable such as white noise. Then in the design problem we may make wg 
large by selecting appropriate weighting functions, but its presence will never 
cause instability. However, in reality wa = Eu-+ dz, so wa depends on the 
signal u and this may cause instability in the presence of feedback when u 
depends on y. Specifically, the closed-loop system (I + (G+ E)K)~1 may be 
unstable for some F # 0. In conclusion, it may be important to explicitly take 
into account model uncertainty when studying feedback control. 
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2.3. Closed-loop stability 


One of the main issues in designing feedback controllers is stability. If the 
feedback gain is too large, then the controller may “overreact” and the closed- 
loop system becomes unstable. This is illustrated next by a simple example. 


=< T 1 1 1 1 
f \ I \ ! 


Ke = 3 (Unstable) 


0 5 10 15 20 25 30 35 40 45 50 
Time [sec] 


Figure 2.5: Effect of proportional gain K. on the closed-loop response y(t) of the 
inverse response process. 


Example 2.1 Inverse response process. Consider the plant (time in seconds) 


3(—28 +1) 


Gls) = 54 alos +1) 


(2.27) 


This is one of two main example processes used in this chapter to illustrate the 
techniques of classical control. The model has a right-half plane (RHP) zero at 


s = 0.5 [rad/s]. This imposes a fundamental limitation on control, and high 
controller gains will induce closed-loop instability. 
This is illustrated for a proportional (P) controller K(s) = Ke. in Figure 2.5, 


where the response y = Tr =GK.(1+ GK.) 'r to a step change in the reference 
(r(t) = 1 fort > 0) ts shown for four different values of K-. The system is seen 
to be stable for K. < 2.5, and unstable for K. > 2.5. The controller gain at the 
limit of instability, Ky = 2.5, is sometimes called the ultimate gain and for this 
value (Ke = Ku) the system is seen to cycle continuously with a period Py = 15.2s, 


corresponding to the frequency wx S 2n/P, = 0.42 [rad/s]. 
Two methods are commonly used to determine closed-loop stability: 


1. The poles of the closed-loop system are evaluated. That is, the zeros of 
1+ L(s) = 0 are found where L is the transfer function around the loop. 
The system is stable if and only if all the closed-loop poles are in the open 
left-half plane (LHP) (that is, poles on the imaginary axis are considered 
“unstable” ). The poles are also equal to the eigenvalues of the state-space 
A-matrix, and this is usually how the poles are computed numerically. 
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2. The frequency response (including region in frequencies) of L(jw) is 
plotted in the complex plane and the number of encirclements it makes 
of the critical point —1 is counted. By Nyquist’s stability criterion (as is 
illustrated in Figure 2.12 and for which a detailed statement is given in 
Theorem 4.14) closed-loop stability is inferred by equating the number 
of encirclements to the number of open-loop unstable poles (RHP-poles). 
For open-loop stable systems where /L(jw) falls with frequency such that 
LL(jw) crosses —180° only once (from above at frequency w1g9), one may 
equivalently use Bode’s stability condition which says that the closed-loop 
system is stable if and only if the loop gain |L| is less than 1 at this 
frequency, that is 


Stability © |L(jw1so)| <1 (2.28) 
where w1g9 is the phase crossover frequency defined by 2 L(jwigo) = —180°. 


Method 1, which involves computing the poles, is best suited for numerical 
calculations. However, time delays must first be approximated as rational 
transfer functions, e.g., Padé approximations. Method 2, which is based on 
the frequency response, has a nice graphical interpretation, and may also be 
used for systems with time delays. Furthermore, it provides useful measures 
of relative stability and forms the basis for several of the robustness tests used 
later in this book. 


Example 2.2 Stability of inverse response process with proportional 

control. Let us determine the condition for closed-loop stability of the plant G in 

(2.27) with proportional control, that is, with K(s) = Ke. and L(s) = K-G(s). 

1. The system is stable if and only if all the closed-loop poles are in the LHP. The 
poles are solutions to 1+ L(s) =0 or equivalently the roots of 


(5s + 1)(10s + 1) + K.3(—28 + 1) =0 


& 50s? + (15 —6K.)s + (1+3K-) =0 (2.29) 


But since we are only interested in the half plane location of the poles, it is not 
necessary to solve (2.29). Rather, one may consider the coefficients ai of the 
characteristic equation ans” +---a18 + a0 = 0 in (2.29), and use the Routh- 
Hurwitz test to check for stability. For second order systems, this test says that 
we have stability if and only if all the coefficients have the same sign. This yields 
the following stability conditions 


(15—6K.) >0; (143K) >0 


or equivalently —1/3 < Ke < 2.5. With negative feedback (Kc > 0) only the 
upper bound is of practical interest, and we find that the maximum allowed gain 
(“ultimate gain”) is Ky, = 2.5 which agrees with the simulation in Figure 2.5. 
The poles at the onset of instability may be found by substituting K. = Ky = 2.5 
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into (2.29) to get 50s° +.8.5 =0, i.e., s = +74/8.5/50 = +50.412. Thus, at the 
onset of instability we have two poles on the imaginary azis, and the system will 
be continuously cycling with a frequency w = 0.412 [rad/s] corresponding to a 
period P, = 27/w = 15.2 s. This agrees with the simulation results in Figure 2.5. 


Magnitude 


W180 


—270 : ; 5 = 
10 1 


0 10 10! 
Frequency [rad/s] 


Figure 2.6: Bode plots for L(s) = ogee ste with K, =1. 

2. Stability may also be evaluated from the frequency response of L(s). A graphical 
evaluation is most enlightening. The Bode plots of the plant (i.e. L(s) with 
Ke = 1) are shown in Figure 2.6. From these one finds the frequency w1so 
where ZL is —180° and then reads off the corresponding gain. This yields 
|L(jwiso)| = K-|G(jwiso)| = 0.4K., and we get from (2.28) that the system is 
stable if and only if K. < 2.5 (as found above). Alternatively, the phase crossover 
frequency may be obtained analytically from: 


ZL(jwiso) = — arctan(2wigo) — arctan(5wis0) — arctan(10wigo) = —180° 
which gives wigo = 0.412 [rad/s] as found in the pole calculation above. The loop 
gain at this frequency is 

3+ \/(2wia0)? +1 _ 
/(5wiso)? + 1+ \/(10wi80)? + 1 


which is the same as found from the graph in Figure 2.6. The stability condition 
|L(jwiso)| <1 then yields K. < 2.5 as expected. 


|L(jwiso)| = Ke 


0.4K. 


2.4 Evaluating closed-loop performance 


Although closed-loop stability is an important issue, the real objective of 
control is to improve performance, that is, to make the output y(t) behave 
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in a more desirable manner. Actually, the possibility of inducing instability is 
one of the disadvantages of feedback control which has to be traded off against 
performance improvement. The objective of this section is to discuss ways of 
evaluating closed-loop performance. 


2.4.1. Typical closed-loop responses 


The following example which considers proportional plus integral (PI) control 
of the inverse response process in (2.27), illustrates what type of closed-loop 
performance one might expect. 


Example 2.3 PI-control of inverse response process. We have already 
studied the use of a proportional controller for the process in (2.27). We found that 
a controller gain of K, = 1.5 gave a reasonably good response, except for a steady- 
state offset (see Figure 2.5). The reason for this offset is the nonzero steady-state 
sensitivity function, S(0) = TR tO = 0.18 (where G(0) = 3 is the steady-state 
gain of the plant). From e = —Sr it follows that for r =1 the steady-state control 
error is —0.18 (as is confirmed by the simulation in Figure 2.5). To remove the 
steady-state offset we add integral action in the form of a PI-controller 


1 

K(s) = Ke (1 + —) (2.30) 
TIS 

The settings for K. and tr can be determined from the classical tuning rules of 

Ziegler and Nichols (1942): 


Ke = Ku 2.2, tr = Puf.2 (2.31) 


where K, us the mazimum (ultimate) P-controller gain and P, is the corresponding 
period of oscillations. In our case Ky, = 2.5 and P, = 15.28 (as observed from the 
simulation in Figure 2.5), and we get K. = 1.14 and 77 = 12.78. Alternatively, Ku 
and P, can be obtained from the model G(s), 


w =1/|G(jwu)|, Pa = 20 /wa (2.32) 


where w, is defined by LG(jwx) = —180°. 

The closed-loop response, with PI-control, to a step change in reference is shown in 
Figure 2.7. The output y(t) has an initial inverse response due to the RHP-zero, but 
it then rises quickly and y(t) =0.9 att = 8.0 s (the rise time). The response is quite 
oscillatory and it does not settle to within £5 % of the final value until after t = 65 s 
(the settling time). The overshoot (height of peak relative to the final value) is about 
62% which is much larger than one would normally like for reference tracking. The 


decay ratio, which is the ratio between subsequent peaks, is about 0.35 which is also 
a bit large. (However, for disturbance rejection the controller settings may be more 
reasonable, and one can always add a prefilier to improve the response for reference 
tracking, resulting in a two degrees-of-freedom controller ). 

The corresponding Bode plots for L, S and T are shown in Figure 2.8. Later, in 
Section 2.4.3, we define stability margins and from the plot of L(jw), repeated in 
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Time [sec] 


Figure 2.7: Closed-loop response to a step change in reference for the inverse 
response process with PlI-control. 


Figure 2.11, we find that the phase margin (PM) is 0.34 rad = 19.4° and the gain 
margin (GM) is 1.63. These margins are too small according to common rules of 
thumb. The peak value of |S| is Ms = 3.92, and the peak value of |T| is Mr = 3.35 
which again are high according to normal design rules. 


Exercise 2.1 Use (2.32) to compute K, and P,, for the process in (2.27). 


In summary, for this example, the Ziegler-Nichols’ PI-tunings are somewhat 
“aggressive” and give a closed-loop system with smaller stability margins and 
a more oscillatory response than would normally be regarded as acceptable. 


2.4.2 Time domain performance 


Step response analysis. The above example illustrates the approach often 
taken by engineers when evaluating the performance of a control system. That 
is, one simulates the response to a step in the reference input, and considers 
the following characteristics (see Figure 2.9): 


e Rise time (t,) : the time it takes for the output to first reach 90% of its 
final value, which is usually required to be small. 

e Settling time (t;) : the time after which the output remains within +5% of 
its final value, which is usually required to be small. 

e Qvershoot : the peak value divided by the final value, which should typically 
be 1.2 (20%) or less. 

e Decay ratio : the ratio of the second and first peaks, which should typically 
be 0.3 or less. 

e Steady-state offset : the difference between the final value and the desired 
final value, which is usually required to be small. 


The rise time and settling time are measures of the speed of the response , 
whereas the overshoot, decay ratio and steady-state offset are related to the 
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Figure 2.8: Typical Bode magnitude and phase plots of L= GK, S and T. 


G(s) = a: K(s) = 1.136(1 + z4;) (Ziegler-Nichols PI controller). 


quality of the response . Another measure of the quality of the response is: 


e Excess variation : the total variation (TV) divided by the overall change at 
steady state, which should be as close to 1 as possible. 


The total variation is the total movement of the output as illustrated in 
Figure 2.10. For the cases considered here the overall change is 1, so the excess 
variation is equal to the total variation. Note that the step response is equal 
to the integral of the corresponding impulse response (e.g., set u(7) = 1 in 
4.11). Some thought then reveals that one can compute the total variation as 
the integrated absolute area (1-norm) of the corresponding impulse response 
(Boyd and Barratt, 1991, p. 98). That is, let y = Tr, then the total variation 


T 
15 Overshoot = A_ | 
: Decay ratio = B/A 


Time 


Figure 2.9: Characteristics of closed-loop response to step in reference. 
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in y for a step change in r is 


TV = [ ” lgr(r)lar 2 llgr(t)lh (2.33) 


where gr(t) is the impulse response, i.e., y(t) resulting from an impulse change 
in r(t). 


Time 


Figure 2.10: Total variation is TV = es v;, and Excess variation is TV/v0. 


The above measures address the output response, y(t). In addition, one 
should consider the magnitude of the manipulated input (control signal, u), 
which usually should be as small and smooth as possible. If there are important 
disturbances, then the response to these should also be considered. Finally, 
one may investigate in simulation how the controller works if the plant model 
parameters are different from their nominal values. 

Another way of quantifying time domain performance is in terms of some 
norm of the error signal e(t) = y(t) — r(¢). For example, one might use 
the integral squared error (ISE), or its square root which is the 2-norm 
of the error signal, |le(t)|l2 = 4/f>° le(r)|?dr. Note that in this case the 
various objectives related to both the speed and quality of response are 
combined into one number. Actually, in most cases minimizing the 2-norm 
seems to give a reasonable trade-off between the various objectives listed 
above. Another advantage of the 2-norm is that the resulting optimization 
problems (such as minimizing ISE) are numerically easy to solve. One 
can also take input magnitudes into account by considering, for example, 


d= Vio (le)? + R\u(t)|?)dt where Q and R are positive constants. This 


is similar to linear quadratic (LQ) optimal control, but in LQ-control one 
normally considers an impulse rather than a step change in r(t), and e(t) 
normally represents the system states. 
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2.4.3. Frequency domain performance 


The frequency-response of the loop transfer function, L(jw), or of various 
closed-loop transfer functions, may also be used to characterize closed-loop 
performance. Typical Bode plots of L, T and S are shown in Figure 2.8. One 
advantage of the frequency domain compared to a step response analysis, is 
that it considers a broader class of signals (sinusoids of any frequency). This 
makes it easier to characterize feedback properties, and in particular system 
behaviour in the crossover (bandwidth) region. We will now describe some 
of the important frequency-domain measures used to assess performance e.g. 
gain and phase margins, the maximum peaks of S and T, and the various 
definitions of crossover and bandwidth frequencies used to characterize speed 
of response. 


Gain and phase margins 


Magnitude 


=1 0 


IC 
Frequency [rad/s] 
Figure 2.11: Typical Bode plot of Z(jw) with PM and GM indicated 


Let L(s) denote the loop transfer function of a system which is closed-loop 
stable under negative feedback. A typical Bode plot and a typical Nyquist 
plot of L(jw) illustrating the gain margin (GM) and phase margin (PM) are 
given in Figures 2.11 and 2.12, respectively. 

The gain margin is defined as 


GM = 1/|L(jwiso)| (2.34) 


where the phase crossover frequency W1g0 is where the Nyquist curve of L(jw) 
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Figure 2.12: Typical Nyquist plot of L(jw) for stable plant with PM and GM 
indicated. Closed-loop instability occurs if L(jw) encircles the critical point —1. 


crosses the negative real axis between —1 and 0, that is 
LL (jwrso) = —180° (2.35) 


If there is more than one crossing the largest value of |L(jwso)| is taken. On a 
Bode plot with a logarithmic axis for |L|, we have that GM (in logarithms, e.g. 
in dB) is the vertical distance from the unit magnitude line down to |L(jwigo)|, 
see Figure 2.11. The GM is the factor by which the loop gain |G'K (jw)| may 
be increased before the closed-loop system becomes unstable. The GM is thus 
a direct safeguard against steady-state gain uncertainty (error). Typically we 
require GM> 2. If the Nyquist plot of D crosses the negative real axis between 
—1 and —oo then a gain reduction margin can be similarly defined from the 
smallest value of |£(jw1go| of such crossings. 
The phase margin is defined as 


PM = ZL (jw,) + 180° (2.36) 


where the gain crossover frequency we is where |L(jw)| first crosses 1 from 
above, that is 


|L(jwe)|=1 (2.37) 
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The phase margin tells how much negative phase (phase lag) we can add to 
L(s) at frequency w, before the phase at this frequency becomes —180° which 
corresponds to closed-loop instability (see Figure 2.12). Typically, we require 
PM larger than 30° or more. The PM is a direct safeguard against time delay 
uncertainty; the system becomes unstable if we add a time delay of 


Gmax = PM/we (2.38) 


Note that the units must be consistent, so if w, is in rad/s then PM must 
be in radians. It is important to note that decreasing the value of w, (lower 
closed-loop bandwidth, slower response) means that we can tolerate larger 
time delay errors. 


Example 2.4 For the PlI-controlled inverse response process example we have 
PM= 19.4° = 19.4/57.3 [rad] = 0.34 [rad] and we = 0.236 [rad/s]. The allowed 
time delay error is then Omax = 0.34 [rad]/0.236 [rad/s] = 1.44 [s/. 


From the above arguments we see that gain and phase margins provide 
stability margins for gain and delay uncertainty. However, as we show below 
the gain and phase margins are closely related to the peak values of |S(jw)| 
and |T (jw)| and are therefore also useful in terms of performance. In short, the 
gain and phase margins are used to provide the appropriate trade-off between 
performance and stability. 


Exercise 2.2 Prove that the maximum additional delay for which closed-loop 
stability is maintained is given by (2.88). 


Exercise 2.3 Derive the approzimation for Ky =1/|G(jwu)| given in (5.76) for 
a first-order delay system. 


Maximum peak criteria 


The maximum peaks of the sensitivity and complementary sensitivity 
functions are defined as 


Ms = max|S(jw)|; Mr = max |T(jw)| (2.39) 


(Note that Ms = ||S||.. and Mr = ||T||,.o in terms of the 1. norm introduced 
later.) Typically, it is required that Mz is less than about 2 (6 dB) and Mr is 
less than about 1.25 (2 dB). A large value of Mg or Mr (larger than about 4) 
indicates poor performance as well as poor robustness. Since $+ T = I, the 
values of Ms and Mr are close when the peaks are large. For stable plants 
we usually have Ms > Mr, but this is not a general rule. An upper bound on 
Mr has been a common design specification in classical control and the reader 
may be familiar with the use of M-circles on a Nyquist plot or a Nichols chart 
used to determine Mr from L(jw). 
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We now give some justification for why we may want to bound the value 
of Ms. Recall that without control e = y —r = Gad — r, and with feedback 
control e = S(Gad — r). Thus, feedback control improves performance in 
terms of reducing |e| at all frequencies where |.S| < 1. Usually, || is small 
at low frequencies, for example, |5(0)| = 0 for systems with integral action. 
But because all real systems are strictly proper we must at high frequencies 
have that ZL — 0 or equivalently S — 1. At intermediate frequencies one 
cannot avoid in practice a peak value, Msg, larger than 1 (e.g., see the 
argument following (2.40)). Thus, there is an intermediate frequency range 
where feedback control degrades performance, and the value of Ms is a 
measure of the worst-case performance degradation. One may also view Mg as 
a robustness measure, as is now explained. To maintain closed-loop stability 
the number of encirclements of the critical point —1 by L(jw) must not change; 
so we want L to stay away from this point. The smallest distance between 
L(jw) and the -1 point is M5", and therefore for robustness, the smaller M5, 
the better. 

There is a close relationship between these maximums peaks and the gain 
and phase margins. Specifically, for a given Ms we are guaranteed 


M. 1 1 
GM > vee PM > Saree a) > Ms [rad] (2.40) 
For example, with Ms = 2 we are guaranteed GM> 2 and PM> 29.0°. 
Similarly, for a given value of Mr we are guaranteed 


1 1 1 
GM >1+ My PM > Saree SE. > Ma [rad] (2.41) 


and therefore with Mr = 2 we have GM> 1.5 and PM> 29.0°. 


Proof of (2.40) and (2.41:. To derive the GM-inequalities notice that L(jwiso) = 
—1/GM (since GM= 1/|L(jwigo)| and L is real and negative at wigo), from which 


we get 
—1 : 1 
S(jwiso) = 


Gage hee aa 
(jwis0) = Gap i= 


(2.42) 


and the results follow. 
To derive the PM-inequalities in (2.40) and (2.41) consider Figure 2.13 where we 
have |S(jwe)| = 1/|1 + L(j/we)| = 1/| — 1 — L(jwe)| and we obtain 


af 


|S(jwe)| = |T(jwe)| = 2sin(PM/2) 


(2.43 


and the inequalities follow. Alternative formulae, which are sometimes used, follow 


from the identity 2sin(PM/2) = \/2(1 — cos(PM)). 


We note with interest that (2.42) requires || to be larger than 1 at 
frequency wigo. This means that provided wigo exists, that is, L(jw) has 
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Figure 2.13: At frequency w, we see from the figure that |1+ L(jw.)| = 2sin(PM/2). 


more than —180° phase lag at some frequency (which is the case for any 
real system), then the peak of |S(jw) must exceed 1. 

In conclusion, we see that specifications on the peaks of |S(jw)| or |T'(jw) 
(Ms or Mr), can make specifications on the gain and phase margins 
unnecessary. For instance, requiring Ms < 2 implies the common rules of 
thumb GM> 2 and PM> 30°. 


2.4.4 Relationship between time and frequency domain 
peaks 


For a change in reference r, the output is y(s) = T(s)r(s). Is there any 
relationship between the frequency domain peak of T(jw), Mr, and any 
characteristic of the time domain step response, for example the overshoot or 
the total variation? To answer this consider a prototype second-order system 
with complementary sensitivity function 


1 


[Cr 
(s) 7282 +27r¢s +1 


(2.44) 


For underdamped systems with ¢ < 1 the poles are complex and yield 
oscillatory step responses. With r(¢) = 1 (a unit step change) the values of the 
overshoot and total variation for y(t) are given, together with M7 and Ms, 
as a function of ¢ in Table 2.1. From Table 2.1, we see that the total variation 
TV correlates quite well with Mr. This is further confirmed by (A.93) and 
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Table 2.1: Peak values and total variation of prototype second-order system 


Time domain Frequency domain 

¢ Overshoot | Total variation || Mr Ms 
2.0 1 1 1 1.05 
1.5 1 1 1 1.08 
1.0 1 1 1 1.15 
0.8 1.02 1.03 1 1.22 
0.6 1.09 1.21 1.04 1.35 
0.4 1.25 1.68 1.36 1.66 
0.2 1.53 3.22 2.55 2.73 
0.1 1.73 6.39 5.03 5.12 
0.01 1.97 63.7 50.0 50.0 


% MATLAB code (Mu toolbox) to generate Table: 
tau=1;zeta=0.1;t=0:0.01:100; 

T = nd2sys(1,[tau*xtau 2*tauxzeta 1]); S = msub(1,T); 
[A,B,C,D]=unpck(T); y1 = step(A,B,C,D,1,t); 
overshoot=max(y1),tv=sum(abs (diff (y1))) 
Mt=hinfnorm(T,1.e-4) ,Ms=hinfnorm(S,1.e-4) 


(2.33) which together yield the following general bounds: 
Mr <TV<(2n+1)Mr (2.45) 


Here n is the order of T(s), which is 2 for our prototype system in (2.44). Given 
that the response of many systems can be crudely approximated by fairly low- 
order systems, the bound in (2.45) suggests that M7 may provide a reasonable 
approximation to the total variation. This provides some justification for the 
use of Mr (rather than Ms) in classical control to evaluate the quality of the 
response. 


2.4.5 Bandwidth and crossover frequency 


The concept of bandwidth is very important in understanding the benefits 
and trade-offs involved when applying feedback control. Above we considered 
peaks of closed-loop transfer functions, Ms and Mr, which are related to the 
quality of the response. However, for performance we must also consider the 
speed of the response, and this leads to considering the bandwidth frequency 
of the system. In general, a large bandwidth corresponds to a faster rise 
time, since high frequency signals are more easily passed on to the outputs. 
A high bandwidth also indicates a system which is sensitive to noise and to 
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parameter variations. Conversely, if the bandwidth is small, the time response 
will generally be slow, and the system will usually be more robust. 

Loosely speaking, bandwidth may be defined as the frequency range [w1, w>] 
over which control is effective. In most cases we require tight control at steady- 
state so w,; = 0, and we then simply call w. = wg the bandwidth. 

The word “effective” may be interpreted in different ways, and this may 
give rise to different definitions of bandwidth. The interpretation we use is 
that control is effective if we obtain some benefit in terms of performance. For 
tracking performance the error is e = y — r = —Sr and we get that feedback 
is effective (in terms of improving performance) as long as the relative error 
e/r = —S is reasonably small, which we may define to be less than 0.707 in 
magnitude. We then get the following definition: 


Definition 2.1 The (closed-loop) bandwidth, wp, is the frequency where 
|S(jw)| first crosses 1/\/2 = 0.707(= —3 dB) from below. 


Another interpretation is to say that control is effective if it significantly 
changes the output response. For tracking performance, the output is y = Tr 
and since without control y = 0, we may say that control is effective as long 
as T is reasonably large, which we may define to be larger than 0.707. This 
leads to an alternative definition which has been traditionally used to define 
the bandwidth of a control system: The bandwidth in terms of T, wr, is 
the highest frequency at which |T(jw)| crosses 1//2 = 0.707(= —3 dB) from 
above. In most cases, the two definitions in terms of S and T yield similar 
values for the bandwidth. However, as we demonstrate below, the definition in 
terms of T may in some cases be a misleading indicator of closed-loop control 
performance. 

In cases where wg and war differ, the situation is generally as follows. Up 
to the frequency wa, |S| is less than 0.7, and control is effective in terms of 
improving performance. In the frequency range [wg,wg7] control still affects 
the response, but does not improve performance — in most cases we find that 
in this frequency range || is larger than 1 and control degrades performance. 
Finally, at frequencies higher than wyr we have S = 1 and control has no 
significant effect on the response. The situation just described is illustrated in 
Example 2.5 below (see Figure 2.15). 

The gain crossover frequency, we, defined as the frequency where |L(jw-))| 
first crosses 1 from above, is also sometimes used to define closed-loop 
bandwidth. It has the advantage of being simple to compute and usually 
gives a value between wg and war. Specifically, for systems with PM < 90° 
we have 


WB <We < WBT (2.46) 


Proof of (2.46): Note that |L(jw-)| = 1 so |S(jw-)| = |T'(jwe)|. Thus, when PM= 90° 
we get |S(jw-)| = |T'(jwe)| = 0.707 (see (2.43)), and we have wg = w. = war. For 


CLASSICAL CONTROL 41 


PM< 90° we get |.S(jw-)| = |T(jwe)| > 0.707, and since wz is the frequency where 
|S(jw)| crosses 0.707 from below we must have wp < we. Similarly, since war is the 
frequency where |T(jw)| crosses 0.707 from above, we must have war > We. 


Another important frequency is the phase crossover frequency, W1g0, defined 
as the first frequency where the Nyquist curve of L(jw) crosses the negative 
real axis between —1 and 0. From (2.42) we get that wigo > war for 
GM> 2.414, and wig0 < war for GM< 2.414, and since in many cases the 
gain margin is about 2.4 we conclude that wig0 is usually close to wzr. It is 
also interesting to note from (2.42) that at wig9 the phase of T (and of L) 
is —180°, so from y = Tr we conclude that at frequency w go the tracking 
response is completely out of phase. Since as just noted wr is often close to 
w go, this further illustrates that wer may be a poor indicator of the system 
bandwidth. 

In conclusion, wg (which is defined in terms of S) and also w, (in terms of 
L) are good indicators of closed-loop performance, while wr (in terms of T) 
may be misleading in some cases. 


Example 2.5 Comparison of wg and wg as indicators of performance. 
An example where wpr is a poor indicator of performance is the following: 


—s+z : pa te 1, 
s(7s+7242)’ s+2z T7841’ 


z=01, r=1 (2.47) 


For this system, both L and T have a RHP-zero at z = 0.1, and we have GM= 2.1, 
PM= 60.1°, Ms = 1.93 and Mr = 1. We find that wg = 0.036 and we = 0.054 are 
both less than z =0.1 (as one should expect because speed of response is limited by 
the presence of RHP-zeros), whereas war = 1/7 = 1.0 is ten times larger than z. 
The closed-loop response to a unit step change in the reference is shown in Figure 
2.14. The rise time is 31.0 s, which is close to1/wp = 28.0s, but very different from 
1l/wer = 1.0s, illustrating that wp is a better indicator of closed-loop performance 
than wpr. 


0.57 7 
5 <6) | 
> 

-0.5; 7 
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0 3 10 15 20... 25, 35 40 45 50 
Time [sec] 
Figure 2.14: Step response for system T = et ae 
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Figure 2.15: Plot of |S| and |7| for system T = et ae 


The Bode plots of S and T are shown in Figure 2.15. We see that |T| +1 up to 
about wer. However, in the frequency range from wp to wer the phase of T (not 
shown) drops from about —40° to about —220°, so in practice tracking is poor in 
this frequency range. For example, at frequency wiso = 0.46 we have T  —0.9, and 
the response to a sinusoidally varying reference r(t) = sinwigot is completely out of 
phase, i.e., y(t) & —0.9r(t). 


We thus conclude that |T| by itself is not a good indicator of performance; 
we must also consider its phase. The reason is that we want JT ~% 1 in order 
to have good performance, and it is not sufficient that |T| ~ 1. On the other 
hand, |S| by itself is a reasonable indicator of performance; it is not necessary 
to consider its phase. The reason for this is that for good performance we 
want S' close to 0 and this will be the case if |S| ~ 0 irrespective of the phase 


of S. 


2.5 Controller design 


We have considered ways of evaluating performance, but one also needs 
methods for controller design. The Ziegler-Nichols’ method used earlier is 
well suited for on-line tuning, but most other methods involve minimizing 
some cost function. The overall design process is iterative between controller 
design and performance (or cost) evaluation. If performance is not satisfactory 
then one must either adjust the controller parameters directly (for example, 
by reducing K. from the value obtained by the Ziegler-Nichols’ rules) or 
adjust some weighting factor in an objective function used to synthesize the 
controller. 

There exists a large number of methods for controller design and some of 
these will be discussed in Chapter 9. In addition to heuristic function rules and 
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on-line tuning we can distinguish between three main approaches to controller 
design: 


1. Shaping of transfer functions. In this approach the designer specifies 
the magnitude of some transfer function(s) as a function of frequency, and 
then finds a controller which gives the desired shape(s). 


(a) Loop shaping. This is the classical approach in which the magnitude 
of the open-loop transfer function, L(jw), is shaped. Usually no 
optimization is involved and the designer aims to obtain |L(jw)| with 
desired bandwidth, slopes etc. We will look at this approach in detail 
later in this chapter. However, classical loop shaping is difficult to apply 
for complicated systems, and one may then instead use the Glover- 
McFarlane H,, loop-shaping design presented in Chapter 9. The method 
consists of a second step where optimization is used to make an initial 
loop-shaping design more robust. 

(b) Shaping of closed-loop transfer functions, such as 5, T and KS. 
Optimization is usually used, resulting in various Ho optimal control 
problems such as mixed weighted sensitivity; more on this later. 


2. The signal-based approach. This involves time domain problem 

formulations resulting in the minimization of a norm of a transfer function. 
Here one considers a particular disturbance or reference change and then 
one tries to optimize the closed-loop response. The “modern” state-space 
methods from the 1960’s, such as Linear Quadratic Gaussian (LQG) 
control, are based on this signal-oriented approach. In LQG the input 
signals are assumed to be stochastic (or alternatively impulses in a 
deterministic setting) and the expected value of the output variance (or 
the 2-norm) is minimized. These methods may be generalized to include 
frequency dependent weights on the signals leading to what is called the 
Wiener-Hopf (or H2-norm) design method. 
By considering sinusoidal signals, frequency-by-frequency, a signal-based 
H.. optimal control methodology can be derived in which the H,., norm of 
a combination of closed-loop transfer functions is minimized. This approach 
has attracted significant interest, and may be combined with model 
uncertainty representations, to yield quite complex robust performance 
problems requiring p-synthesis; an important topic which will be addressed 
in later chapters. 

3. Numerical optimization. This often involves multi-objective optimiza- 
tion where one attempts to optimize directly the true objectives, such as 
rise times, stability margins, etc. Computationally, such optimization prob- 
lems may be difficult to solve, especially if one does not have convexity. 
Also, by effectively including performance evaluation and controller de- 
sign in a single step procedure, the problem formulation is far more criti- 
cal than in iterative two-step approaches. The numerical optimization ap- 
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proach may also be performed on-line, which might be useful when dealing 
with cases with constraints on the inputs and outputs. On-line optimiza- 
tion approaches such as model predictive control are likely to become more 
popular as faster computers and more efficient and reliable computational 
algorithms are developed. 


2.6 Loop shaping 


In the classical loop-shaping approach to controller design, “loop shape” refers 
to the magnitude of the loop transfer function L = GK as a function of 
frequency. An understanding of how K can be selected to shape this loop 
gain provides invaluable insight into the multivariable techniques and concepts 
which will presented later in the book, and so we will discuss loop shaping in 
some detail in the next two sections. 


2.6.1 Trade-offs in terms of L 


Recall equation (2.19), which yields the closed-loop response in terms of the 
control error e = y — Tr: 


e=—(I4+L)'r4+(1+L)7'Gad— (I+ L)"'Ln (2.48) 
—_ ——" e——=/=—_—_’ 
S S ft 


For “perfect control” we want e = y — r = 0; that is, we would like 
ex0-d+0-r+0-n (2.49) 


The first two requirements in this equation, namely disturbance rejection and 
command tracking, are obtained with S = 0, or equivalently, T ~ I. Since 
S = (I+ L)—', this implies that the loop transfer function L must be large 
in magnitude. On the other hand, the requirement for zero noise transmission 
implies that T ~ 0, or equivalently, S ~ I, which is obtained with L ~ 0. This 
illustrates the fundamental nature of feedback design which always involves a 
trade-off between conflicting objectives; in this case between large loop gains 
for disturbance rejection and tracking, and small loop gains to reduce the 
effect. of noise. 

It is also important to consider the magnitude of the control action u (which 
is the input to the plant). We want u small because this causes less wear and 
saves input energy, and also because u is often a disturbance to other parts 
of the system (e.g. consider opening a window in your office to adjust your 
body temperature and the undesirable disturbance this will impose on the air 
conditioning system for the building). In particular, we usually want to avoid 
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fast changes in u. The control action is given by u = K(r — ym) and we find 
as expected that a small wu corresponds to small controller gains and a small 
L=GK. 

The most important design objectives which necessitate trade-offs in 
feedback control are summarized below: 


1. Performance, good disturbance rejection: needs large controller gains, i.e. 
L large. 

. Performance, good command following: L large. 

. Stabilization of unstable plant: D large. 

Mitigation of measurement noise on plant outputs: D small. 

Small magnitude of input signals: K small and LD small. 

. Physical controller must be strictly proper: K — 0 at high frequencies. 

. Nominal stability (stable plant): L small (because of RHP-zeros and time 
delays). 

8. Robust stability (stable plant): L small (because of uncertain or neglected 

dynamics). 


NO ow ww 


Fortunately, the conflicting design objectives mentioned above are generally 
in different frequency ranges, and we can meet most of the objectives by using 
a large loop gain (|£| > 1) at low frequencies below crossover, and a small 
gain (|L| <1) at high frequencies above crossover. 


2.6.2 Fundamentals of loop-shaping design 


By loop shaping one usually means a design procedure that involves explicitly 
shaping the magnitude of the loop transfer function, |L(jw)|. Here L(s) = 
G(s)K(s) where K(s) is the feedback controller to be designed and G(s) 
is the product of all other transfer functions around the loop, including 
the plant, the actuator and the measurement device. Essentially, to get the 
benefits of feedback control we want the loop gain, |L(jw)|, to be as large 
as possible within the bandwidth region. However, due to time delays, RHP- 
zeros, unmodelled high-frequency dynamics and limitations on the allowed 
manipulated inputs, the loop gain has to drop below one at and above 
some frequency which we call the crossover frequency we. Thus, disregarding 
stability for the moment, it is desirable that |L(jw)| falls sharply with 
frequency. To measure how |L| falls with frequency we consider the logarithmic 
slope N = dln|L|/dlnw. For example, a slope N = —1 implies that |L| drops 
by a factor of 10 when w increases by a factor of 10. If the gain is measured 
in decibels (dB) then a slope of N = —1 corresponds to —20 dB/ decade. The 
value of —N at higher frequencies is often called the roll-off rate. 

The design of L(s) is most crucial and difficult in the crossover region 
between w, (where |£| = 1) and w1g9 (where LZ = —180°). For stability, we at 
least need the loop gain to be less than 1 at frequency wo, i-e., |L(jwiso)| < 1. 
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Thus, to get a high bandwidth (fast response) we want w 89 large, that is, we 
want the phase lag in L to be small. Unfortunately, this is not consistent with 
the desire that |£(jw)| should fall sharply. For example, the loop transfer 
function L = 1/s” (which has a slope N = —n on a log-log plot) has a phase 
ZL = —n-90°. Thus, to have a phase margin of 45° we need ZI > —135°, 
and the slope of |£| cannot exceed N = —1.5. 

In addition, if the slope is made steeper at lower or higher frequencies, then 
this will add unwanted phase lag at intermediate frequencies. As an example, 
consider £)(s) given in (2.13) with the Bode plot shown in Figure 2.3. Here 
the slope of the asymptote of |L| is —1 at the gain crossover frequency (where 
|LZ1(jwe)| = 1), which by itself gives —90° phase lag. However, due to the 
influence of the steeper slopes of —2 at lower and higher frequencies, there is 
a “penalty” of about —35° at crossover, so the actual phase of Ly at w, is 
approximately —125°. 

The situation becomes even worse for cases with delays or RHP-zeros in 
L(s) which add undesirable phase lag to L without contributing to a desirable 
negative slope in L. At the gain crossover frequency w,, the additional phase 
lag from delays and RHP-zeros may in practice be —30° or more. 

In summary, a desired loop shape for |L(jw)| typically has a slope of about 
—1 in the crossover region, and a slope of —2 or higher beyond this frequency, 
that is, the roll-off is 2 or larger. Also, with a proper controller, which is 
required for any real system, we must have that LD = GK rolls off at least 
as fast as G. At low frequencies, the desired shape of |L| depends on what 
disturbances and references we are designing for. For example, if we are 
considering step changes in the references or disturbances which affect the 
outputs as steps, then a slope for |Z| of —1 at low frequencies is acceptable. 
If the references or disturbances require the outputs to change in a ramp-like 
fashion then a slope of —2 is required. In practice, integrators are included in 
the controller to get the desired low-frequency performance, and for offset-free 
reference tracking the rule is that 


e L(s) must contain at least one integrator for each integrator in r(s). 


To see this, let L(s) = L(s)/s"’ where L(0) is nonzero and finite and ny is 
the number of integrators in L(s) — sometimes ny is called the system type. 
Consider a reference signal of the form r(s) = 1/s"". For example, if r(t) is 
a unit step then r(s) = 1/s (n, = 1), and if r(t) is a ramp then r(s) = 1/s? 
(n, = 2). The final value theorem for Laplace transforms is 

lim e(t) = lim se(s) (2.50) 


t—00 s—0 
In our case, the control error is 
1 gMI— Nr 


sre = 


= st + L(s) 
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and to get zero offset (i.e. e(t > oc) = 0) we must from (2.50) require ny > ny, 
and the rule follows. In Section 2.6.4, we discuss how to specify the loop shape 
when disturbance rejection is the primary objective of control. 

In conclusion, one can define the desired loop transfer function in terms of 
the following specifications: 


1. The gain crossover frequency, w,, where |L(jw,)| = 1. 

2. The shape of L(jw), e.g., in terms of the slope of |L(jw)| in certain 
frequency ranges. Typically, we desire a slope of about N = —1 around 
crossover, and a larger roll-off at higher frequencies. The desired slope at 
lower frequencies depends on the nature of the disturbance or reference 
signal. 

3. The system type, defined as the number of pure integrators in L(s). 


Loop-shaping design is typically an iterative procedure where the designer 
shapes and reshapes |L(jw)| after computing the phase and gain margins, 
the peaks of closed-loop frequency responses (Mr and Ms), selected closed- 
loop time responses, the magnitude of the input signal, etc. The procedure is 
illustrated next by an example. 


Example 2.6 Loop-shaping design for the inverse response process. 


Magnitude 


We W180 


3605 = m7 1 
10 10 10 10 


Frequency [rad/s] 


Figure 2.16: Frequency response of L(s) in (2.51) for loop-shaping design with 
K, = 0.05. (GM= 2.92, PM= 54°, w. = 0.15, wiso = 0.43, Ms = 1.75, Mp = 1.11) 


We will now design a loop-shaping controller for the example process in (2.27) 
which has a RHP-zero at s = 0.5. The RHP-zero limits the achievable bandwidth 
and so the crossover region (defined as the frequencies between we and wigo) will 
be at about 0.5 rad/s. We only require the system to have one integrator (type 1 
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Figure 2.17: Response to step in reference for loop-shaping design. 
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Figure 2.18: Magnitude Bode plot of controller (2.52) for loop-shaping design. 


system), and therefore a reasonable approach is to let the loop transfer function have 
a slope of —1 at low frequencies, and then to roll off with a higher slope at frequencies 
beyond 0.5 rad/s. We choose the following loop-shape 


L(s) =3K (aera) 


“s(2s + 1)(0.33s + 1) (2.51) 


The frequency response (Bode plots) of L is shown in Figure 2.16. The asymptotic 
slope of |L| is —1 up to 3 rad/s where it changes to —2. The controller corresponding 
to the loop-shape in (2.51) is 


(10s + 1)(5s + 1) 
“s(2s + 1)(0.33s +1)’ 


K(s) = Ke = 0.05 (2.52) 
The controller has zeros at the locations of the plant poles. This is desired in this case 
because we do not want the slope of the loop shape to drop at the break frequencies 
1/10 = 0.1 [rad/s] and 1/5 = 0.2 [rad/s] just before crossover. The controller 
gain K. was selected to get a reasonable trade-off between speed of response and 
the robustness margin to instability. The phase of L is —90° at low frequency, and 
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at w = 0.5 [rad/s] the additional contribution from the term Sa in (2.51) is 
—90°, so for stability we need we < 0.5 [rad/s]. The selection of Ke = 0.05 yields 
We = 0.15 [rad/s] corresponding to GM= 2.92 and PM=54°. The corresponding time 
response is shown in Figure 2.17. It is seen to be much better than the responses with 
either the simple Pl-controller in Figure 2.7 or with the P-controller in Figure 2.5. 
Figure 2.17 also shows that the magnitude of the input signal is reasonable (assuming 
the signals have been scaled such that we want the input to be less than about 1 in 
magnitude). This means that the controller gain is not too large at high frequencies. 
The magnitude Bode plot for the controller (2.52) is shown in Figure 2.18. It is 
interesting to note that in the crossover region around w = 0.5 [rad/s] the controller 
gain is quite constant, around 1 in magnitude, which is similar to the “best” gain 
found using a P-controller (see Figure 2.5). 


Limitations imposed by RHP-zeros and time delays. 


Based on the above loop-shaping arguments we can now examine how the 
presence of delays and RHP-zeros limit the achievable control performance. 
We have already argued that if we want the loop shape to have a slope of 
—1 around crossover (w,), with preferably a steeper slope before and after 
crossover, then the phase lag of DE at w, will necessarily be at least —90°, 
even when there are no RHP-zeros or delays. Therefore, if we assume that for 
performance and robustness we want a phase margin of about 35° or more, 
then the additional phase contribution from any delays and RHP-zeros at 
frequency w, cannot exceed about —55°. 

First consider a time delay 6. It yields an additional phase contribution of 
—Ow, which at frequency w = 1/6 is —1 rad = —57° (which is more than —55°). 
Thus, for acceptable control performance we need w, < 1/0, approximately. 

Next consider a real RHP-zero at s = z. To avoid an increase in slope caused 
by this zero we place a pole at s = —z such that the loop transfer function 
contains the term oa the form of which is referred to as all-pass since its 
magnitude equals 1 at all frequencies. The phase contribution from the all- 
pass term at w = z/2 is —2arctan(0.5) = —53° (which is close to —55°), so 
for acceptable control performance we need w, < z/2, approximately. 


2.6.3 Inverse-based controller design 


In Example 2.6, we made sure that L(s) contained the RHP-zero of G(s), 
but otherwise the specified L(s) was independent of G(s). This suggests the 
following possible approach for a minimum-phase plant (i.e, one with no RHP- 
zeros or time delays). Select a loop shape which has a slope of —1 throughout 
the frequency range, namely 


Leje=2 (2.53) 
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where w, is the desired gain crossover frequency. This loop shape yields a 
phase margin of 90° and an infinite gain margin since the phase of L(jw) 
never reaches —180°. The controller corresponding to (2.53) is 
We vay 

K(s) = ae (s) (2.54) 
That is, the controller inverts the plant and adds an integrator (1/s). This 
is an old idea, and is also the essential part of the IMC (Internal Model 
Control) design procedure of Morari (Morari and Zafiriou, 1989) which has 
proved successful in many applications. However, there are at least two good 
reasons for why this controller may not be a good choice: 


1. The controller will not be realizable if G(s) has more poles than zeros, and 
may in any case yield large input signals. These problems may be partly 
fixed by adding high-frequency dynamics to the controller. 

2. The loop shape resulting from (2.53) is not generally desirable, unless the 
references and disturbances affect the outputs as steps. This is illustrated 
by the following example. 


Example 2.7 Disturbance process. We now introduce our second main 
example process and control problem in which disturbance rejection is an important 
objective in addition to command tracking. We assume that the plant has been 
appropriately scaled as outlined in Section 1.4. 

Problem formulation. Consider the disturbance process described by 


200 1 100 


CO) = sai oss tie? SO) = Heat 


(2.55) 


with time in seconds. A block diagram is shown in Figure 2.20. The control objectives 
are: 


1. Command tracking: The rise time (to reach 90% of the final value) should be less 
than 0.3 [s] and the overshoot should be less than 5%. 

2. Disturbance rejection: The output in response to a unit step disturbance should 
remain within the range [—1,1] at all times, and it should return to 0 as quickly 
as possible (|y(t)| should at least be less than 0.1 after 3 s). 

3. Input constraints: u(t) should remain within the range [—1, 1] at all times to avoid 
input saturation (this is easily satisfied for most designs). 


Analysis. Since Ga(0) = 100 we have that without control the output response to a 
unit disturbance (d = 1) will be 100 times larger than what is deemed to be acceptable. 
The magnitude |Ga(jw)| is lower at higher frequencies, but it remains larger than 1 
up to wa © 10 [rad/s] (where |Ga(jwa)| = 1). Thus, feedback control is needed up to 
frequency wa, so we need we to be approximately equal to 10 rad/s for disturbance 
rejection. On the other hand, we do not want w-, to be larger than necessary because 
of sensitivity to noise and stability problems associated with high gain feedback. We 
will thus aim at a design with w, = 10 [rad/s]. 
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Inverse-based controller design. We will consider the “inverse-based” design as 
given by (2.53) and (2.54) with we = 10. This yields an unrealizable controller and 
therefore we choose to approximate the plant term (0.05s +1)? by (0.1s+1) and 
then in the controller we let this term be effective over one decade, i.e., we use 
(0.1s + 1)/(0.01s +1) to give the realizable design 


we 10s+1 0.1s5+1 L Gee 0.1ls+1 
s 200 0.01s+1’ . s (0.05s + 1)2(0.01s + 1)’ 


Ko(s) = we = 10 


(2.56) 


1.5 1.5 


0.5 0.5 
0 0 
0 1 2 3 0 1 2 3 
Time [sec] Time [sec] 
(a) Tracking response. (b) Disturbance response. 


Figure 2.19: Responses with “inverse-based” controller Ko(s) for disturbance 
process. 


The response to a step reference is excellent as shown in Figure 2.19 (a). The rise 
time is about 0.16 s and there is no overshoot so the specifications are more than 
satisfied. However, the response to a step disturbance (Figure 2.19 (b)) is much too 
sluggish. Although the output stays within the range [—1,1], it is still 0.75 at t = 3 
s (whereas it should be less than 0.1). Because of the integral action the output does 
eventually return to zero, but it does not drop below 0.1 until after 23 s. 


The above example illustrates that the simple “inverse-based” design 
method where L has a slope of about N = —1 at all frequencies, does 
not always yield satisfactory designs. The objective of the next section is 
to understand why the disturbance response was so poor, and to propose a 
more desirable loop shape for disturbance rejection. 


2.6.4 Loop shaping for disturbance rejection 


At the outset we assume that the disturbance has been scaled such that at each 
frequency |d(w)| < 1, and the main control objective is to achieve |e(w)| < 1. 
With feedback control we have e = y = SGad, so to achieve |e(w)| < 1 for 
|d(w)| = 1 (the worst-case disturbance) we require |SGa(jw)| < 1,Vw, or 
equivalently, 

[1+ DL) > |Gal Vw (2.57) 
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At frequencies where |Ga| > 1, this is approximately the same as requiring 
|L| > |Ga|. However, in order to minimize the input signals, thereby reducing 
the sensitivity to noise and avoiding stability problems, we do not want to use 
larger loop gains than necessary (at least at frequencies around crossover). 
A reasonable initial loop shape Lmin(s) is then one that just satisfies the 
condition 

|Lmin| 7 IGa| (2.58) 


where the subscript min signifies that Lmin is the smallest loop gain to satisfy 
|e(w)| < 1. Since L = GK the corresponding controller with the minimum 
gain satisfies 

|Kmin| © |G~"*Ga| (2.59) 


In addition, to improve low-frequency performance (e.g. to get zero steady- 
state offset), we often add integral action at low frequencies, and use 


S+wW] 


K\|= 
|| =| 


IG~"*Gal (2.60) 


This can be summarized as follows: 


e For disturbance rejection a good choice for the controller is one which 
contains the dynamics (Gy) of the disturbance and inverts the dynamics 
(G) of the inputs (at least at frequencies just before crossover). 

e For disturbances entering directly at the plant output, Ga = 1, and we 
get |Kmin| = |G~*], so an inverse-based design provides the best trade-off 
between performance (disturbance rejection) and minimum use of feedback. 

e For disturbances entering directly at the plant input (which is a common 
situation in practice — often referred to as a load disturbance), we have 


Ga = G and we get |Kmin| = 1, so a simple proportional controller with 
unit gain yields a good trade-off between output performance and input 
usage. 


e Notice that a reference change may be viewed as a disturbance directly 
affecting the output. This follows from (1.17), from which we get that a 
maximum reference change r = R may be viewed as a disturbance d = 1 
with Ga(s) = —R where R is usually a constant. This explains why selecting 
K to be like G~+ (an inverse-based controller) yields good responses to step 
changes in the reference. 


In addition to satisfying |L| ~ |Ga| (eq. 2.58) at frequencies around 
crossover, the desired loop-shape L(s) may be modified as follows: 


1. Around crossover make the slope N of |L| to be about —1. This is to achieve 
good transient behaviour with acceptable gain and phase margins. 

2. Increase the loop gain at low frequencies as illustrated in (2.60) to improve 
the settling time and to reduce the steady-state offset. Adding an integrator 
yields zero steady-state offset to a step disturbance. 
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3. Let L(s) roll off faster at higher frequencies (beyond the bandwidth) in 
order to reduce the use of manipulated inputs, to make the controller 
realizable and to reduce the effects of noise. 


The above requirements are concerned with the magnitude, |L(jw)|. In 
addition, the dynamics (phase) of L(s) must be selected such that the closed- 
loop system is stable. When selecting L(s) to satisfy |L| ~ |Ga| one should 
replace Ga(s) by the corresponding minimum-phase transfer function with the 
same magnitude, that is, time delays and RHP-zeros in Ga(s) should not be 
included in L(s) as this will impose undesirable limitations on feedback. On 
the other hand, any time delays or RHP-zeros in G(s) must be included in 
L = GK because RHP pole-zero cancellations between G(s) and K(s) yield 
internal instability, see Chapter 4. 


Remark. The idea of including a disturbance model in the controller is well known 
and is more rigorously presented in, for example, research on the internal model 
principle (Wonham, 1974), or the internal model control design for disturbances 
(Morari and Zafiriou, 1989). However, our development is simple, and sufficient for 
gaining the insight needed for later chapters. 


Example 2.8 Loop-shaping design for the disturbance process 


d 


Figure 2.20: Block diagram representation of the disturbance process in (2.55) 


Consider again the plant described by (2.55). The plant can be represented by 
the block diagram in Figure 2.20, and we see that the disturbance enters at the plant 
input in the sense that G and Gq share the same dominating dynamics as represented 
by the term 200/(10s + 1). 

Step 1. Initial design. From (2.58) we know that a good initial loop shape looks like 
|Lmin| = |Ga| = || at frequencies up to crossover. The corresponding controller 
is K(s) =G7'Lmin = 0.5(0.058 +1)”. This controller is not proper (i.e, it has more 
zeros than poles), but since the term (0.05s+1)* only comes into effect at 1/0.05 = 20 
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[rad/s], which is beyond the desired gain crossover frequency we = 10 [rad/s], we may 
replace it by a constant gain of 1 resulting in a proportional controller 


K1(s) =0.5 (2.61) 


The magnitude of the corresponding loop transfer function, |Li(jw)|, and the 
response (yi(t)) to a step change in the disturbance are shown in Figure 2.21. This 
simple controller works surprisingly well, and for t < 3s the response to a step change 
in the disturbance response is not much different from that with the more complicated 
inverse-based controller Ko(s) of (2.56) as shown earlier in Figure 2.19. However, 
there is no integral action and yi(t) > 1 as t > oo. 


0 1.5 
L2,Lz 
© 2 Ya 
010 | — 1 
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& 
10° 0.5 
¥3 
5 L3 Yy2 
“2 Ly ,L2 6 
10” 10° 10° 0 1 2 3 
Frequency [rad/s] Time [sec] 
(a) Loop gains. (b) Disturbance responses. 


Figure 2.21: Loop shapes and disturbance responses for controllers Ki, K2 and K3 
for the disturbance process. 


Step 2. More gain at low frequency. To get integral action we multiply the 
controller by the term *+#L, where wr is the frequency up to which the term is 
effective (the asymptotic value of the term is 1 for w > wr). For performance we 
want large gains at low frequencies, so we want wr to be large, but in order to 
maintain an acceptable phase margin (which is 44.7° for controller K1) the term 
should not add too much negative phase at frequency we, s0 wr should not be too 
large. A reasonable value is wr = 0.2w. for which the phase contribution from ster 
is arctan(1/0.2) — 90° = —11° at we. In our case we © 10 [rad/s], so we select the 
following controller 


s+2 
8 


Ko(s) = 0.5 (2.62) 


The resulting disturbance response (y2) in shown in Figure 2.21 satisfies the 
requirement that |y(t)| < 0.1 at time t = 3 s, but y(t) exceeds 1 for a short time. 
Also, the response is slightly oscillatory as might be expected since the phase margin 
is only 31° and the peak values for |S| and |T| are Ms = 2.28 and Mr = 1.89 (see 
Table 2.2). 

Step 3. High-frequency correction. To increase the phase margin and improve 
the transient response we supplement the controller with “derivative action” by 
multiplying Ko(s) by a lead-lag term which is effective over one decade starting 
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at 20 rad/s: 
s+2 0.05541 


s 0.0055 +1 
The corresponding disturbance response (y3) is seen to be faster initially and y3(t) 
stays below 1. 


K3(s) = 0.5 (2.63) 


Table 2.2: Alternative loop-shaping designs for the disturbance process 


Reference Disturbance 
GM PM We Ms | Mr tp ras Ymax | y(t = 3) 
Spec.> ~ 10 <.3 | < 1.05 <1 <0.1 
Ko 9.95 | 72.9° | 11.4 | 1.34 1 16 1.00 0.95 75 
Ky 4.04 | 44.7° | 8.48 | 1.83 | 1.33 21 1.24 1.35 .99 
Ke 3.24 | 30.9° | 8.65 | 2.28 | 1.89 19 1.51 1.27 001 
K3 19.7 | 50.9° | 9.27 | 1.43 | 1.23 16 1.24 0.99 001 


Table 2.2 summarizes the results for the four loop-shaping designs; the inverse- 
based design Ko for reference tracking and the three designs Ki, K2 and K3 
for disturbance rejection. Although controller K3 satisfies the requirements for 
disturbance rejection, it is not satisfactory for reference tracking; the overshoot is 
24% which is significantly higher than the mazimum value of 5%. On the other 
hand, the inverse-based controller Ko inverts the term 1/(10s +1) which is also in 
the disturbance model, and therefore yields a very sluggish response to disturbances 
(the output is still 0.75 att =3 s whereas it should be less than 0.1). 


2.6.5 Two degrees-of-freedom design 


For the disturbance process example we see from Table 2.2 that none of 
the controller designs meet all the objectives for both reference tracking and 
disturbance rejection. The problem is that for reference tracking we typically 
want the controller to look like iq see (2.54), whereas for disturbance 
rejection we want the controller to look like 4G-1Gq, see (2.60). We cannot 
achieve both of these simultaneously with a single (feedback) controller. 

The solution is to use a two degrees-of-freedom controller where the 
reference signal r and output measurement ym are independently treated by 
the controller, rather than operating on their difference r — y,. There exist 
several alternative implementations of a two degrees-of-freedom controller. 
The most general form is shown in Figure 1.3(b) on page 12 where the 
controller has two inputs (r and ym) and one output (u). However, the 
controller is often split into two separate blocks as shown in Figure 2.22 where 
i, denotes the feedback part of the controller and K; a reference prefilter. The 
feedback controller K,, is used to reduce the effect of uncertainty (disturbances 
and model error) whereas the prefilter K, shapes the commands to improve 
performance. In general, it is optimal to design the combined two degrees-of- 


56 MULTIVARIABLE FEEDBACK CONTROL 
d 


Figure 2.22: Two degrees-of-freedom controller. 


freedom controller kK in one step. However, in practice Ky, is often designed 
first for disturbance rejection, and then K; is designed to improve reference 
tracking. This is the approach taken here. 

Let T = L(1+L)~! (with L = GK,) denote the complementary sensitivity 
function for the feedback system. Then for a one degree-of-freedom controller 
y = Tr, whereas for a two degrees-of-freedom controller y = TK,r. If the 
desired transfer function for reference tracking (often denoted the reference 
model) is Trer, then the corresponding ideal reference prefilter K, satisfies 
TK, = tref, OF 

K,(s) = T~+(s)Tret(8) (2.64) 


Thus, in theory we may design K,(s) to get any desired tracking response 
Tret(s). However, in practice it is not so simple because the resulting K;(s) 
may be unstable (if G(s) has RHP-zeros) or unrealizable, and relatively 
uncertain if T(s) is not known exactly. A convenient practical choice of 
prefilter is the lead-lag network 


aye ee (2.65) 
Thlag$ +1 

Here we select ead > Tlag if we want to speed up the response, and Tiead < Tag 

if we want to slow down the response. If one does not require fast reference 

tracking, which is the case in many process control applications, a simple lag 

is often used (with Tieaa = 0). 


Example 2.9 Two degrees-of-freedom design for the disturbance process 


In Example 2.8 we designed a loop-shaping controller K3(s) for the plant in (2.55) 
which gave good performance with respect to disturbances. However, the command 
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Figure 2.23: Tracking responses with the one degree-of-freedom controller (K3) 
and the two degrees-of-freedom controller (K3, K,r3) for the disturbance process. 


tracking performance was not quite acceptable as is shown by y3 in Figure 2.28. 
The rise time is 0.16 s which is better than the required value of 0.3 s, but the 
overshoot is 24% which is significantly higher than the maximum value of 5%. To 
improve upon this we can use a two degrees-of-freedom controller with Ky = Ka, 
and we design K,(s) based on (2.64) with reference model Tre = 1/(0.1s + 1) 
(a first-order response with no overshoot). To get a low-order K,(s), we may 
either use the actual T(s) and then use a low-order approximation of K,r(s), or 
we may start with a low-order approzimation of T(s). We will do the latter. From 
the step response y3 in Figure 2.23 we approrimate the response by two parts; a 
fast response with time constant 0.1 s and gain 1.5, and a slower response with 
time constant 0.5 s and gain —0.5 (the sum of the gains is 1). Thus we use 
T(s)2 8 oS (0.7541) ry» from which (2.64) yields K,(s) = 93s*t 


Ols+1  O.5s¢1 — (.1s+1)(0.5s 0.7s+1° 
Following closed-loop simulations we modified this slightly to arrive at the design 


0.55+1 1 
0.65s+1 0.03s+1 


Ky3(s) = (2.66) 


where the term 1/(0.03s +1) was included to avoid the initial peaking of the input 
signal u(t) above 1. The corresponding tracking response is shown in Figure 2.23. The 
rise time is 0.255 which is still better than the requirement of 0.38, and the overshoot 
is only 2.3%. The disturbance response is the same as curve y3 in Figure 2.21. In 


conclusion, we are able to satisfy all specifications using a two degrees-of-freedom 
controller. 


Loop shaping applied to a flexible structure 


The following example shows how the loop-shaping procedure for disturbance 
rejection, can be used to design a one degree-of-freedom controller for a very 
different kind of plant. 


Example 2.10 Loop shaping for a flexible structure 
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(b) Open-loop and closed-loop 
disturbance responses with K = 
a 


(a) Magnitude plot of |G] = |Gq|. 
Figure 2.24: Flexible structure in (2.67). 


Consider the following model of a flexible structure with disturbances occurring at 

the plant input 
2.58(s? +1) 

(s? + 0.52)(s? + 22) 
From the Bode magnitude plot in Figure 2.24 (a) we see that |Ga(jw)| >> 1 
around the resonance frequencies of 0.5 and 2 [rad/s], so control is needed at these 
frequencies. The dashed line in Figure 2.24 (b) shows the open-loop response to a 
unit step disturbance. The output y(t) is seen to cycle between —2 and 2 (outside 
the allowed range —1 to 1), which confirms that control is needed. From (2.59) 
a controller which meets the specification |y(w)| < 1 for |d(w)| = 1 is given by 
|Kmin (Jw)| = IG"Gal = 1. Indeed the controller 


G(s) = Ga(s) = 


(2.67) 


K(s)=1 (2.68) 


turns out to be a good choice as is verified by the closed-loop disturbance response in 
Figure 2.24 (b) (solid line); the output goes up to about 0.5 and then returns to zero. 
The fact that the choice L(s) = G(s) gives closed-loop stability is not immediately 
obvious since |G| has 4 gain crossover frequencies. However, instability cannot occur 
because /G > —180° at all frequencies. 


2.6.6 Conclusion loop shaping 


The loop-shaping procedure outlined and illustrated by the examples above 
is well suited for relatively simple problems, as might arise for stable plants 
where L(s) crosses the negative real axis only once. Although the procedure 
may be extended to more complicated systems the effort required by the 
engineer is considerably greater. In particular, it may be very difficult to 
achieve stability. 

Fortunately, there exist alternative methods where the burden on the 
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engineer is much less. One such approach is the Glover-McFarlane H.. loop- 
shaping procedure which is discussed in detail in Chapter 9. It is essentially 
a two-step procedure, where in the first step the engineer decides on a loop 
shape, || (denoted the “shaped plant” G,) as outlined in this section, and 
in the second step the optimization provides the necessary phase corrections 
to get a stable and robust design. The method is applied to the disturbance 
process in Example 9.3 on page 9.3. 


Remark. Another design philosophy which deals directly with shaping both the 
gain and phase of L(s) is the quantitative feedback theory (QFT), see Horowitz 
(1991). 


2.7 Shaping closed-loop transfer functions 


In this section, we introduce the reader to shaping of closed-loop transfer 
functions where we synthesize a controller by minimizing an H.,. performance 
objective. The topic is discussed further in Section 3.4.6 and in more detail in 
Chapter 9. 

Specifications directly on the open-loop transfer function L = GK, as in 
the loop-shaping design procedures of the previous section, make the design 
process transparent as it is clear how changes in L(s) affect the controller 
K(s) and vice versa. An apparent problem with this approach, however, is 
that it does not consider directly the closed-loop transfer functions, such as 
S and T, which determine the final response. The following approximations 


apply 


|[Lijw)|)>1 => Set; Texel 


|\Lijw)|K&1 => Sel; TRL 


but in the crossover region where |L(jw)| is close to 1, one cannot infer 
anything about S and T from |L(jw)|. The phase of L(jw) is then crucial. 
For example, |S| and |T'| may experience large peaks if L(jw) is close to —1. 

An alternative design strategy is to directly shape the magnitudes of closed- 
loop transfer functions, such as S(s) and T(s). Such a design strategy can 
be formulated as an H.. optimal control problem, thus automating the 
actual controller design and leaving the engineer with the task of selecting 
reasonable bounds (“weights”) on the desired closed-loop transfer functions. 
Before explaining how this may be done in practice, we discuss the terms Ho, 
and Hp. 
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2.7.1 The terms H,, and H» 


The H.. norm of a stable scalar transfer function f(s) is simply its peak value 
as a function of frequency, that is, 


IIF(s)lloo & max | f(jw)| (2.69) 


Strictly speaking, we should here replace “max” (the maximum value) by 
“sup” (the supremum, the least upper bound). This is because the maximum 
may only be approached as w —> oo and may therefore not actually be 
achieved. However, for engineering purposes there is no difference between 
“sup” and “max”. 

The terms H. norm and H.. control are intimidating at first, and a 
name conveying the engineering significance of H.. would have been better. 
After all, we are simply talking about a design method which aims to press 
down the peak(s) of one or more selected transfer functions. However, the 
term Hoo, which is purely mathematical, has now established itself in the 
control community. To make the term less forbidding, an explanation of its 
background may help. First, the symbol oo comes from the fact that the 
maximum magnitude over frequency may be written as 


foe) 


max |f(jw)| = lim. ({ 


Essentially, by raising | f| to an infinite power we pick out its peak value. Next, 
the symbol H stands for “Hardy space”, and H. in the context of this book 
is the set of transfer functions with bounded oo-norm, which is simply the set 
of stable and proper transfer functions. 

Similarly, the symbol Hz stands for the Hardy space of transfer functions 
with bounded 2-norm, which is the set of stable and strictly proper transfer 
functions. The Hy norm of a strictly proper stable transfer function is defined 
as 


iie)ran) 


loo) 


isto? (Lf Genre) (2.70) 


The factor 1/./2z is introduced to get consistency with the 2-norm of the 
corresponding impulse response, see (4.119). Note that the H2-norm of a 
semi-proper (or bi-proper) transfer function (where lims_,o. f(s) is a nonzero 
constant) is infinite, whereas its Ho. norm is finite. An example of a semi- 
proper transfer function (with an infinite H2 norm) is the sensitivity function 
S=(1+GK)"1. 

We will now outline H,., optimal design, and give an example. We will return 
with more details regarding both H2 and H. optimal control in Chapter 9. 
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2.7.2 Weighted sensitivity 


As already discussed, the sensitivity function S is a very good indicator 
of closed-loop performance, both for SISO and MIMO systems. The main 
advantage of considering S is that because we ideally want S small, it is 
sufficient to consider just its magnitude |.$|; that is, we need not worry about 
its phase. Typical specifications in terms of S include: 


1. Minimum bandwidth frequency w% (defined as the frequency where |S(jw)| 
crosses (0.707 from below). 

. Maximum tracking error at selected frequencies. 

. System type, or alternatively the maximum steady-state tracking error, A. 

. Shape of S over selected frequency ranges. 

. Maximum peak magnitude of S, ||S(jw)|lo <M. 


oR G bo 


The peak specification prevents amplification of noise at high frequencies, 
and also introduces a margin of robustness; typically we select M = 2. 
Mathematically, these specifications may be captured simply by an upper 
bound, 1/|wp(s)|, on the magnitude of S where wp(s) is a weight selected by 
the designer.. The subscript P stands for performance since S is mainly used 
as a performance indicator, and the performance requirement becomes 


|S(jw)| < 1/|we(jw)|, Vw (2.71) 


& |wpS|<1, Ww © [llwpSllo <1 (2.72) 


The last equivalence follows from the definition of the H,. norm, and in words 
the performance requirement is that the H. norm of the weighted sensitivity, 
wpS, must be less than one. In Figure 2.25 (a), an example is shown where 
the sensitivity, |S|, exceeds its upper bound, 1/|wp]|, at some frequencies. 
The resulting weighted sensitivity, |wpS| therefore exceeds 1 at the same 
frequencies as is illustrated in Figure 2.25 (b). Note that we usually do not 
use a log-scale for the magnitude when plotting weighted transfer functions, 
such as |wpS|. 

Weight selection. An asymptotic plot of a typical upper bound, 1/|wp|, 
is shown in Figure 2.26. The weight illustrated may be represented by 


8/M +w} 


2. 
s+wpA 2:0) 


wp(s) = 


and we see that |wp(jw)|~! is equal to A < 1 at low frequencies, is equal 
to M > 1 at high frequencies, and the asymptote crosses 1 at the frequency, 
Ww, which is approximately the bandwidth requirement. For this weight the 
loop shape L = w%,/s yields an S which exactly matches the bound (2.72) 


at frequencies below the bandwidth and easily satisfies (by a factor M) the 
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Figure 2.25: Case where |.S| exceeds its bound 1/|wp|, resulting in ||wpS|lo. > 1. 


bound at higher frequencies. This L has a slope in the frequency range below 
crossover of N = —1. In some cases, in order to improve performance, we 
may want a steeper slope for L (and S) below the bandwidth, and then a 
higher-order weight may be selected. A weight which asks for a slope —2 for 
L at lower frequencies is 


(s/M? + wp)? 


(s + wy, At/2)2 ae 


Wp2(s) = 


The insight gained from the previous section on loop-shaping design is very 
useful for selecting weights. For example, for disturbance rejection we must 
satisfy |SGa(jw)| < 1 at all frequencies (assuming the variables have been 
scaled to be less than 1 in magnitude). It then follows that a good initial choice 
for the performance weight is to let wp(s) look like |Ga(jw)| at frequencies 
where |Gq| > 1. In other cases, one may first obtain an initial controller using 
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Figure 2.26: Inverse of performance weight. Exact and asymptotic plot of 
1/|w p(jw)| in (2.73). 


another design procedure, and the resulting sensitivity |.S(jw)| may then be 
used to select a performance weight for a subsequent .. design. 


Exercise 2.4 Make an asymptotic plot of 1/|wp2| in (2.74) and compare with the 
asymptotic plot of 1/|wp| in (2.73). 


2.7.3. Stacked requirements: mixed sensitivity 


The specification ||wPS||o <1 puts a lower bound on the bandwidth, but not 
an upper one, and nor does it allow us to specify the roll-off of L(s) above the 
bandwidth. To do this one can make demands on another closed-loop transfer 
function, for example, on the complementary sensitivity T= I—S=GKS. 
Also, to achieve robustness or to avoid too large input signals, one may want 
to place bounds on the transfer function KS. 

For instance, one might specify an upper bound 1/|w7| on the magnitude 
of T to make sure that L rolls off sufficiently fast at high frequencies, and 
an upper bound, 1/|wz|, on the magnitude of KS to restrict the size of 
the input signals, u = KS(r — Gad). To combine these “mixed sensitivity” 
specifications, a “stacking approach” is usually used, resulting in the following 
overall specification: 


wpSs 
IN lloo = maxa(N(jw)) <1; N= | wrT (2.75) 
sy WyKS 


We here use the maximum singular value, a(N(jw)), to measure the size of 
the matrix N at each frequency. For SISO systems, N is a vector and a(N) 
is the usual Euclidean vector norm: 


o(N) = J/|weS?? + lwrT 2 + wrk Sl? (2.76) 
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The stacking procedure is selected for mathematical convenience as it does not 
allow us to exactly specify the bounds on the individual transfer functions as 
described above. For example, assume that 0) (K) and ¢2(K) are two functions 
of K (which might represent ¢1(A) = wpS and $2(K) = wrT) and that we 
want to achieve 

Idi] <1 and |do| <1 (2.77) 


This is similar to, but not quite the same as the stacked requirement 


[$1] = Vir IeP <1 (2.78) 


Objectives (2.77) and (2.78) are very similar when either || or |@.| is small, 
but in the worst case when |¢1| = |d2|, we get from (2.78) that |¢,| < 0.707 
and |¢2| < 0.707. That is, there is a possible “error” in each specification 
equal to at most a factor /2 = 3 dB. In general, with n stacked requirements 
the resulting error is at most \/n. This inaccuracy in the specifications is 
something we are probably willing to sacrifice in the interests of mathematical 
convenience. In any case, the specifications are in general rather rough, and 
are effectively knobs for the engineer to select and adjust until a satisfactory 
design is reached. 

After selecting the form of N and the weights, the H. optimal controller 
is obtained by solving the problem 


min ||N(A)||>0 (2.79) 


where K is a stabilizing controller. Let yo = minx ||N(K)||.. denote the 
optimal H.. norm. An important property of H..-optimal controllers is that 
they yield a flat frequency response, that is, ¢(N(jw)) = 4 at all frequencies. 
The practical implication is that, except for at most a factor \/n, the transfer 
functions resulting from a solution to (2.79) will be close to yo times the 
bounds selected by the designer. This gives the designer a mechanism for 
directly shaping the magnitudes of a(S), o(T), (KS), and so on. A good 
tutorial introduction to H .control is given by Kwakernaak (1993). 


Example 2.11 7H. mixed sensitivity design for the disturbance process. 
Consider again the plant in (2.55), and consider an Hoo mized sensitivity S/KS 
design in which 


wukS 


Tt was stated earlier that appropriate scaling has been performed so that the inputs 
should be about 1 or less in magnitude, and we therefore select a simple input weight 
Wu =1. The performance weight is chosen, in the form of (2.73), as 


N= ies (2.80) 


_ 8/M +05. 


M=1. s=10, A=10" 2.81 
s+uxA’ 5, Wp 0, 0 (2.81) 


wpi(s) 
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A value of A=0 would ask for integral action in the controller, but to get a stable 
weight and to prevent numerical problems in the algorithm used to synthesize the 
controller, we have moved the integrator slightly by using a small non-zero value 
for A. This has no practical significance in terms of control performance. The value 
we = 10 has been selected to achieve approzimately the desired crossover frequency 
we of 10 rad/s. The Hoo.-problem is solved with the -toolbox in MATLAB using the 
commands in table 2.3 (see Section 3.8 for more details). 


Table 2.3: MATLAB program to synthesize an H.. controller. 


¥, Uses the Mu-toolbox 

G=nd2sys(1,conv([10 1],conv([0.05 1],[0.05 1])),200); YPlant is G. 
M=1.5; wb=10; A=1.e-4; Wp = nd2sys([1/M wb], [1 wb*A]); Wu = 1; % Weights. 

y 

vA 

% Generalized plant P is found with function sysic: 

y 

vA 

systemnames = ’G Wp Wu’; 

inputvar = ’[ r(1); u(1)]°; 


outputvar = ’[Wp; Wu; r-G]’; 
input_toG = ’[u]’; 
input_toWp = ’[r-G]’; 
input_to_Wu = ’[u]’; 
sysoutname = ’P’; 
cleanupsysic = ’yes’; 

sysic; 

h 

% Find H-infinity optimal controller: 
y 

h 


nmeas=1; nu=1; gmn=0.5; gmx=20; tol=0.001; 
[khinf,ghinf,gopt] = hinfsyn(P,nmeas,nu,gmn,gmx,tol); 
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Figure 2.27: Inverse of performance weight (dashed line) and resulting sensitivity 
function (solid line) for two H.. designs (1 and 2) for the disturbance process. 


For this problem, we achieved an optimal Ho. norm of 1.37, so the weighted 
sensitivity requirements are not quite satisfied (see design 1 in Figure 2.27). 
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(a) Tracking response. (b) Disturbance response. 


Figure 2.28: Closed-loop step responses for two alternative Ho. designs (1 and 2) 
for the disturbance process. 


Nevertheless, the design seems good with Ms = 1.30, Mr = 1.0, GM= 8.04, 
PM= 71.2° and we = 7.22, and the tracking response is very good as shown by 
curve y; in Figure 2.28 (a). The design is actually very similar to the loop-shaping 
design for references, Ko, which was an inverse-based controller. 

However, we see from curve y; in Figure 2.28 (b) that the disturbance response 
is very sluggish. If disturbance rejection is the main concern, then from our earlier 
discussion in Section 2.6.4 this motivates the need for a performance weight that 
specifies higher gains at low frequencies. We therefore try 


(s/M\? +)? 
(s+wRAV?)? ? 


wp2(s) = M =15,wz =10,A=10 ° (2.82) 
The inverse of this weight is shown in Figure 2.27, and is seen from the dashed line 
to cross 1 in magnitude at about the same frequency as weight wp, but it specifies 
tighter control at lower frequencies. With the weight wp2, we get a design with an 
optimal Ho norm of 2.21, yielding Ms = 1.63, Mr = 1.48, GM = 4.76, PM = 43.3° 
and We = 11.34. The design is actually very similar to the loop-shaping design 
for disturbances, K3. The disturbance response is very good, whereas the tracking 
response has a somewhat high overshoot, see curve y2 in Figure 2.28 (a). 

In conclusion, design 1 is best for reference tracking whereas design 2 is best for 
disturbance rejection. Two get a design with both good tracking and good disturbance 
rejection we need a two degrees-of-freedom controller. 


2.8 Conclusion 


The main purpose of this chapter has been to present the classical ideas and 
techniques of feedback control. We have concentrated on SISO systems so 
that insights into the necessary design trade-offs, and the design approaches 
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available, can be properly developed before MIMO systems are considered. We 
introduced the H...-problem based on weighted sensitivity, for which typical 
performance weights are given in (2.73) and (2.74). 


3 


INTRODUCTION TO 
MULTIVARIABLE CONTROL 


In this chapter, we introduce the reader to multi-input multi-output (MIMO) 
systems. We discuss the singular value decomposition (SVD), multivariable control, 
and multivariable RHP-zeros. The need for a careful analysis of the effect of 
uncertainty in MIMO systems is motivated by two examples. Finally we describe a 
general control configuration that can be used to formulate control problems. In this 
chapter we introduce many important topics which are considered in greater detail 
later in the book. It is hoped that the chapter should be accessible also to readers 
with only a course on classical SISO control as their basis. 


3.1 Introduction 


We consider a multi-input multi-output (MIMO) plant with m inputs and | 
outputs. Thus, the basic transfer function model is y(s) = G(s)u(s), where y 
is an! x 1 vector, u is an m x 1 vector and G(s) is an / x m transfer function 
matrix. 

If we make a change in the first input, w,, then this will generally affect 
all the outputs, yi, yo, .-., yt, that is, there is interaction between the inputs 
and outputs. A non-interacting plant would result if wu; only affects y1, ua 
only affects y2 and so on. 

The main difference between a scalar (SISO) system and a MIMO system 
is the presence of directions in the latter. Directions are relevant for vectors 
and matrices, but not for scalars. However, despite the complicating factor 
of directions, most of the ideas and techniques presented in the previous 
chapter on SISO systems may be extended to MIMO systems. The singular 
value decomposition (SVD) provides a useful way of quantifying multivariable 
directionality, and we will see that most SISO results involving the absolute 
value (magnitude) may be generalized to multivariable systems by considering 
the maximum singular value. An exception to this is Bode’s stability condition 
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which has no generalization in terms of singular values. This is related to the 
fact that it is difficult to find a good measure of phase for MIMO transfer 
functions. 

The chapter is organized as follows. We start by presenting some rules 
for determining multivariable transfer functions. Although most of the 
formulae for scalar systems apply, we must exercise some care since matrix 
multiplication is not commutative, that is, in general GK #4 KG. Then we 
introduce the singular value decomposition and show how it may be used to 
study directions in multivariable systems. We also give a brief introduction to 
multivariable control and decoupling. We then consider a simple plant with a 
multivariable RHP-zero and show how the effect of this zero may be shifted 
from one output channel to another. After this we discuss robustness, and 
study two example plants, each 2 x 2, which demonstrate that the simple gain 
and phase margins used for SISO systems do not generalize easily to MIMO 
systems. Finally, we consider a general control problem formulation for use 
later in the book. 

At this point, you may find it useful to browse through Appendix A where 
some important mathematical tools are described. Exercises to test your 
understanding of this mathematics are given at the end of this chapter. 


3.2 Transfer functions for MIMO systems 


(a) Cascade system (b) Positive feedback system 


Figure 3.1: Block diagrams for the cascade rule and the feedback rule. 


The following three rules are useful when evaluating transfer functions for 
MIMO systems 


Cascade rule. For the cascade (series) interconnection of Gy and Gy in 
Figure 3.1 (a), the overall transfer function matrix is G = GoG,. 


Remark. The order of the transfer function matrices in G = G2Gi (from left to 
right) is the reverse of the order in which they appear in the block diagram of 
Figure 3.1 (a) (from left to right). This has led some authors to use block diagrams 
in which the inputs enter at the right hand side. However, in this case the order of 
the transfer function blocks in a feedback path will be reversed compared with their 
order in the formula, so no fundamental benefit is obtained. 
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Feedback rule. With reference to the positive feedback system in 
Figure 3.1 (b) we have v = (I — L)~1u where L = GG, is the transfer 
function around the loop. 


Push-through rule. For matrices of appropriate dimensions 
Gi(I — GoGi)7! = (I — GiG2)'Gh (3.1 


Proof; Equation (3.1) is verified by pre-multiplying both sides by (J — GiG'2) and 
post-multiplying both sides by (J — G2G1). 


Exercise 3.1 Derive the cascade and feedback rules. 


The cascade and feedback rules can be combined into the following MIMO 
rule for evaluating closed-loop transfer functions from block diagrams. 


MIMO Rule: Start from the output and write down the blocks as you 
meet them when moving backwards (against the signal flow), taking the 
most direct path towards the input. If you exit from a feedback loop then 
include a term (I—L)~1 for positive feedback (or (I+ L)—' for negative 
feedback) where L is the transfer function around that loop (evaluated 
against the signal flow starting at the point of exit from the loop). 


Care should be taken when applying this rule to systems with nested loops. 
For such systems it is probably safer to write down the signal equations and 
eliminate internal variables to get the transfer function of interest. The rule 
is best understood by considering an example. 


Figure 3.2: Block diagram corresponding to (3.2). 


Example 3.1 The transfer function for the block diagram in Figure 3.2 is given 
by 
z= (Pit PK (I — Po K)~| Po)w (3.2) 
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To derive this from the MIMO rule above we start at the output z and move 
backwards towards w. There are two branches, one of which gives the term Pi, 
directly. In the other branch we move backwards and meet P2, and then K. We then 
exit from a feedback loop and get aterm (I—L)~' (positive feedback) with L = Pook, 
and finally we meet Pio. 


Exercise 3.2 Use the MIMO rule to derive the transfer functions from u to y and 
from u to z in Figure 3.1(b). Use the push-through rule to rewrite the two transfer 
functions. 


Exercise 3.3 Use the MIMO rule to show that (2.18) corresponds to the negative 
feedback system in Figure 2.4. 


Negative feedback control systems 


Figure 3.3: Conventional negative feedback control system. 


For the negative feedback system in Figure 3.3, we define L to be the loop 
transfer function as seen when breaking the loop at the output of the plant. 
Thus, for the case where the loop consists of a plant G and a feedback 
controller K we have 

L=GK (3.3) 


The sensitivity and complementary sensitivity are then defined as 


s4(r4+b); TAI-S=LII+L) (3.4) 


In Figure 3.3, T is the transfer function from r to y, and S is the transfer 
function from d, to y; also see equations (2.16) to (2.20) which apply to MIMO 
systems. 

S and T are sometimes called the output sensitivity and output 
complementary sensitivity, respectively, and to make this explicit one may 
use the notation Lo a GK, So 4S and To 4 T. This is to distinguish 
them from the corresponding transfer functions evaluated at the input to the 
plant. 

We define Ly; to be the loop transfer function as seen when breaking the 
loop at the input to the plant with negative feedback assumed. In Figure 3.3 


ly 2iKGE (3.5) 
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The input sensitivity and input complementary sensitivity functions are then 
defined as 


Sr=(I4+Ly)71; Tr=I-Sp=L1(I +11)" (3.6) 


In Figure 3.3, —Ty, is the transfer function from dy to u. Of course, for SISO 
systems L; = L, $7; = S, and Ty; =T. 


Exercise 3.4 In Figure 3.3, what transfer function does S; represent? Evaluate 
the transfer functions from di and dz to r—y. 


The following relationships are useful: 


(+L) '+L(0+L)1=8+T=!I (3.7) 

G(I+ KG)" = (I+ GK)"'G (3.8) 
GK(I+GK)1=G(I+KG)"'K = (I+ GK)"'GK (3.9) 
T=L(I+L)-1 =(14+(£)71)7} (3.10) 


Note that the matrices G and K in (3.7)-(3.10) need not be square whereas 
L =GK is square. (3.7) follows trivially by factorizing out the term ([+L)~? 
from the right. (3.8) says that G'S; = SG and follows from the push-through 
tule. (3.9) also follows from the push-through rule. (3.10) can be derived from 
the identity M>'Msz* = (MzM,)!. 

Similar relationships, but with G and K interchanged, apply for the transfer 
functions evaluated at the plant input. To assist in remembering (3.7)-(3.10) 
note that G comes first (because the transfer function is evaluated at the 
output) and then G and K alternate in sequence. A given transfer matrix 
never occurs twice in sequence. For example, the closed-loop transfer function 
G(I + GK)~! does not exist (unless G is repeated in the block diagram, but 
then these G’s would actually represent two different physical entities). 


Remark 1 The above identities are clearly useful when deriving transfer functions 
analytically, but they are also useful for numerical calculations involving state-space 
realizations, e.g. L(s) = C(sI — A)~'B + D. For example, assume we have been 
given a state-space realization for L = GK with n states (so A is an X n matrix) 
and we want to find that of T. Then we can first form S = (1 + L)~' with n states, 
and then multiply it by £ to obtain T = SL with 2n states. However, a minimal 
realization of T has only n states. This may be obtained numerically using model 
reduction, but it is preferable to find it directly using T = I — S, see (3.7). 


Remark 2 Note also that the right identity in (3.10) can only be used to compute 
the state-space realization of T if that of L~! exists, so L must semi-proper with 
D £0 (which is rarely the case in practice). On the other hand, since L is square, 
we can always compute the frequency response of L(jw)~' (except at frequencies 
where L(s) has jw-axis poles), and then obtain T(jw) from (3.10). 
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In Appendix A.6 we present some factorizations of the sensitivity function 
which will be useful in later applications. For example, (A.138) relates the 
sensitivity of a perturbed plant, S’ = (I+ G’K)—!, to that of the nominal 
plant, S = (I+ GK)~!. We have 


oS Sepa), Ee eee (3.11) 


where Eo is an output multiplicative perturbation representing the difference 
between G and G’, and T is the nominal complementary sensitivity function. 


3.3. Multivariable frequency response analysis 


The transfer function G(s) is a function of the Laplace variable s and can be 
used to represent a dynamic system. However, if we fix s = s9 then we may 
view G(s) simply as a complex matrix, which can be analyzed using standard 
tools in matrix algebra. In particular, the choice so = jw is of interest since 
G(jw) represents the response to a sinusoidal signal of frequency w. 


3.3.1 Obtaining the frequency response from G(s) 
G(s) 


Figure 3.4: System G(s) with input d and output y 


The frequency domain is ideal for studying directions in multivariable systems 
at any given frequency. Consider the system G(s) in Figure 3.4 with input d(s) 
and output y(s): 

y(s) = G(s)d(s) (3.12) 


(We here denote the input by d rather than by u to avoid confusion with 
the matrix U used below in the singular value decomposition). In Section 2.1 
we considered the sinusoidal response of scalar systems. These results may 
be directly generalized to multivariable systems by considering the elements 
9:3 (jw) of the matrix G(jw). We have 


© i; (jw) represents the sinusoidal response from input j to output i. 


To be more specific, apply to input channel j a scalar sinusoidal signal given 
by 
d;(t) = djo sin(wt + a;) (3.13) 
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This input signal is persistent, that is, it has been applied since t = —oo. Then 
the corresponding persistent output signal in channel 7 is also a sinusoid with 
the same frequency 

yi(t) = yio sin(wt + fj) (3.14) 


where the amplification (gain) and phase shift may be obtained from the 
complex number g;; (jw) as follows 


Yio 


ie Igis(ju)|, Pr — aj = Logis Je) (3.15) 
jo 


In phasor notation, see (2.7) and (2.9), we may compactly represent the 
sinusoidal time response described in (3.13)-(3.15) by 


Yi (w) = giz (Jw) d; (w) (3.16) 


where 
jw) = dyes. ulw) = yese™ (3.17) 


Here the use of w (and not jw) as the argument of d;(w) and y;(w) implies that 
these are complex numbers, representing at each frequency w the magnitude 
and phase of the sinusoidal signals in (3.13) and (3.14). 

The overall response to simultaneous input signals of the same frequency 
in several input channels is, by the superposition principle for linear systems, 
equal to the sum of the individual responses, and we have from (3.16) 


Yyi(W) = ga (Jw) di (w) + gi2(jw)d2(w = 2.4 jw)d (3.18) 


or in matrix form 


y(w) = G(jw)d(w) (3.19) 
where 
dy (w) yi (w) 
dy(w) yo (w) 
d(w) = dy(w) and (4) = | oc) (3.20) 
dw) m(w) 


represent the vectors of sinusoidal input and output signals. 


Example 3.2 Consider a2x2 multivariable system where we simultaneously apply 
sinusoidal signals of the same frequency w to the two input channels. 


core a eee 2 


76 MULTIVARIABLE FEEDBACK CONTROL 


The corresponding output signal is 


Vos | _ i sin(wt | (3.22) 


ya(t) yoo sin(wt + G2) 


which can be computed by multiplying the complerz matriz G(jw) with the complex 
vector d(w): 


eI F1 1oei@1 
y(w) = G(jw)d(w); y(w) = pes , dw) = Es (3.23) 


3.3.2. Directions in multivariable systems 


For a SISO system y = Gd the gain at a given frequency is simply 


Iyw)| _ [Glie)d6)) _ 0 — era, 
aol ey ay ee 


The gain depends of the frequency w, but since the system is linear it is 
independent of the input magnitude |d(w)|. Things are not quite as simple 
for MIMO systems where the input and output signals are both vectors, and 
we need to “sum up” the magnitudes of the elements in the vector by use of 
some norm, as discussed in Appendix A.5.1. If we select the vector 2-norm, 
the usual measure of length, then at a given frequency w the magnitude of 
the vector input signal is 


Id) lle = D> |dj(w)|? = Gig + Bo + °° (3.24) 


and the magnitude of the vector output signal is 


ly()ll2 = { p> _, Iya =V Yio + Y50 + (3.25) 


The gain of the system G(s) for a particular input signal d(w) is then given 
by the ratio 


IIww)lle _ |GGw)dw)lle _ V¥t0 + 450 
I|d() 2 I|d(w)|l2 V dig + do 


(3.26) 


Again the gain depends on the frequency w and again it is independent of the 
input magnitude ||d(w)||2. However, for a MIMO system there are additional 
degrees of freedom and the gain depends also on the direction of the input d. 
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Example 3.3 For a system with two inputs, d = pede the gain is in general 


different for the following five inputs: 


1 0 0.707 0.707 0.6 
oe | ee H aS Rea a cea ce a 
(which all have the same magnitude ||d||2 = 1 but are in different directions). For 
example, for the 2 x 2 system 
C2 E | (3.27) 


3 2 


(a constant matriz) we get for the five inputs 


_fs _[4 _ [6.36 _ [0.707 _ [0.2 
Y= 1g 92 = 19) 0¥8 = | a54 | 44 = 19 7971795 = | 0.2 


and the 2-norm of these five outputs (i.e. the gain for the five inputs) are 
lly1|l2 = 5.83, |ly2ll2 = 4.47, |lyall2 = 7.30, ||ya||2 = 1.00, ||ys||2 = 0.28 


This dependency of the gain on the input direction is illustrated graphically in 
Figure 3.5 where we we have used the ratio d2o /dio as an independent variable to 
represent the input direction. We see that, depending on the ratio d2o/dio, the gain 
varies between 0.27 and 7.34. 


The maximum value of the gain in (3.26) as the direction of the input is 
varied is the maximum singular value of G, 
_. IGdll2 


a(G) = max = max ||Gd 3.28 
(G) alg ee ll2 (3.28) 


whereas the minimum gain is the minimum singular value of G, 
 Gdllo 
a(G) = mi = mi 
2G) = ee Tala Wate 
We will discuss this in detail below. The last identities in (3.28) and (3.29) 


follow because the gain is independent of the input magnitude for a linear 
system. 


|Gd|l2 (3.29) 


3.3.3. Eigenvalues are a poor measure of gain 


Before discussing the singular values we want to demonstrate that the 
magnitudes of the eigenvalues of a transfer function matrix, e.g. |A;(G(jw)|, 
do not provide a useful means of generalizing the SISO gain, |G(jw)|. First of 
all, eigenvalues can only be computed for square systems, and even then they 
can be very misleading. To see this consider the system y = Gd with 


= le al (3.30) 
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a(Gi1) = 7.34 


lly ll2/llall2 
iS 


N 
T 


0 
d2o/dio 


Figure 3.5: Gain ||Gid||2/||d||2 as a function of d2o/dio for G; in (3.27). 


which has both eigenvalues \; equal to zero. However, to conclude from the 
eigenvalues that the system gain is zero is clearly misleading. For example, 
with an input vector d= [0 1]? we get an output vector y = [100 O]?. 

The “problem” is that the eigenvalues measure the gain for the special case 
when the inputs and the outputs are in the same direction, namely in the 
direction of the eigenvectors. To see this let ¢; be an eigenvector of G and 
consider an input d = ¢;. Then the output is y = Gt; = yt; where \; is the 
corresponding eigenvalue, and so |\;| measures the gain in the direction tj. 
This may be useful for stability analysis, but not for performance. 

To find useful generalizations of |G| for the case when G is a matrix, we 
need the concept of a matrix norm, denoted ||G||. Two important properties 
which must be satisfied for a matrix norm are the triangle inequality 


|G1 + Gall < [|G@il| + [Go| 
and the multiplicative property 
|G1Gol| < |Gill - Gall 


(see Appendix A.5 for more details). As we may expect, the magnitude of the 


largest eigenvalue, p(G) 4 |Amax(G)| (the spectral radius), does not satisfy 
the properties of a matrix norm, also see (A.113). 

In Appendix A.5.2 we introduce several matrix norms, such as the Frobenius 
norm |/G||7, the sum norm ||G||sum, the maximum column sum ||G|li1, the 
maximum row sum ||G'|;.0, and the maximum singular value ||G'|i2 = o(G) 
(the latter three norms are induced by a vector norm, e.g. see (3.28); this is 
the reason for the subscript 7). We will use all of these norms in this book, 
each depending on the situation. However, in this chapter we will mainly use 
the induced 2-norm, a(G). Notice that (G) = 100 for the matrix in (3.30). 


Exercise 3.5 Compute the spectral radius and the five matriz norms mentioned 
above for the matrices in (8.27) and (3.80). 
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3.3.4 Singular value decomposition 


The singular value decomposition (SVD) is defined in Appendix A.3. Here 
we are interested in its physical interpretation when applied to the frequency 
response of a MIMO system G(s) with m inputs and | outputs. 

Consider a fixed frequency w where G(jw) is a constant | x m complex 
matrix, and denote G(jw) by G for simplicity. Any matrix G may be 
decomposed into its singular value decomposition, and we write 


G=UsVv" (3.31) 
where 


= is an 1 x m matrix with k = min{l,m} non-negative singular values, oj, 
arranged in descending order along its main diagonal; the other entries 
are zero. The singular values are the square roots of the eigenvalues of 
G4G, where G? is the complex conjugate transpose of G. 


oi(G) = \/\i(G4G) (3.32) 


U is an! xl unitary matrix of output singular vectors, u;, 
V is an m x m unitary matrix of input singular vectors, v,;, 


This is illustrated by the SVD of a real 2 x 2 matrix which can be written 


as cee | 3 ° | cos6) +sin 65 (3.33) 


sinO;  cos6, 0 «a —sin6. +cos 6 
a 
U DS) vr 


where the two singular values are given in (A.35). From (3.33) we see that the 
matrices U and V involve rotations and that their columns are orthonormal. 
The singular values are sometimes called the principal values or principal 
gains, and the associated directions are called principal directions. 


Caution. It is standard notation to use the symbol U to denote the matrix of 
output singular vectors. This is unfortunate as it is also standard notation to use 
u (lower case) to represent the input signal. The reader should be careful not to 
confuse these two. When we refer to u; below, then this is the output singular vector 
formed from the i’th column of U. 

Input and output directions. The column vectors of U represent the 

output directions of the plant. They are orthogonal and of unit length 
(orthonormal), that is 


[lei ll2 = V [een |? + uial?... + Jui? = 1 (3.34) 
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ufu;=1, uu; =0, iFj (3.35) 
Likewise, the column vectors of V are orthogonal and of unit length, and 
represent the input directions. These input and output directions are related 


through the singular values. To see this, note that since V is unitary we have 
VV =I, so (3.31) may be written as GV = UY, which for column i becomes 


where v; and u; are vectors, whereas o; is a scalar. That is, if we consider an 
input in the direction v;, then the output is in the direction u;. Furthermore, 
since ||v,;||[2 = 1 and ||u;||2 = 1 we see that the 7’th singular value o; gives 
directly the gain of the matrix G in this direction. In other words 
Gu; 2 
o:(@) = ||Gui|2 = "il (3.37) 


Ilvill2 


Some advantages of the SVD over the eigenvalue decomposition for analyzing 
gains and directionality of multivariable plants are: 


1. The singular values give better information about the gains of the plant. 
2. The plant directions obtained from the SVD are orthogonal. 
3. The SVD also applies directly to non-square plants. 


Maximum and minimum singular value. As already stated, it can be 
shown that the largest gain for any input direction is equal to the maximum 
singular value 


> 


Gd G 

mare Hale _ Grills _ 5 «@ 
#0 |[dll2_—_— [lea fle 

and that the smallest gain for any input direction is equal to minimum singular 

value 


a(G) (3.38) 


_ Gala _ [Goel 
oo [dle als eae) 


where k = min{l,m}. Thus, for any vector d we have that 


<a(G) (3.40) 


Define uy = U,v; = 0, uz, = u and vz, = v. Then it follows that 
Gt = ci, Gu=cu (3.41) 


The vector v corresponds to the input direction with largest amplification, and 
u is the corresponding output direction in which the inputs are most effective. 
The direction involving © and & is sometimes referred to as the “strongest”, 
“high-gain” or “most important” direction. The next most important direction 
is associated with vz and wz, and so on (see Appendix A.3.6) until the “least 
important”, “weak” or “low-gain” direction which is associated with v and u. 
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Example 3.4 Consider again the system (3.27) in Example 3.3, 


5 4 
Gi= 5 | (3.42) 


The singular value decomposition of G1 is 


G,= 0.872 0.490 7.343 0 0.794 —0.608 “ 
‘0.490 —0.872 0 0.272 0.608 0.794 


S—— SS —9§ ee 
U x VA 


0.794 
0.608 


|: This confirms 


The largest gain of 7.348 is for an input in the direction 0 = iF and the 


—0.608 


smallest gain of 0.272 is for an input in the direction v = 0.794 


the findings in Example 3.8. 


Since in (3.42) both inputs affect both outputs, we say that the system is 
interactive. This follows from the relatively large off-diagonal elements in G1. 
Furthermore, the system is ill-conditioned, that is, some combinations of the 
inputs have a strong effect on the outputs, whereas other combinations have a 
weak effect on the outputs. This may be quantified by the condition number, 
the ratio between the gains in the strong and weak directions; which for the 
system in (3.42) is 7.343/0.272 = 27.0. 


Example 3.5 Shopping cart. Consider a shopping cart (supermarket trolley) 
with fired wheels which we may want to move in three directions; forwards, sideways 
and upwards. This is a simple illustrative example where we can easily figure out the 
principal directions from experience. The strongest direction, corresponding to the 
largest singular value, will clearly be in the forwards direction. The next direction, 
corresponding to the second singular value, will be sideways. Finally, the most 
“difficult” or “weak” direction, corresponding to the smallest singular value, will 
be upwards (lifting up the cart). 

For the shopping cart the gain depends strongly on the input direction, i.e., 
the plant is ill-conditioned. Control of ill-conditioned plants is sometimes difficult, 
and the control problem associated with the shopping cart can be described as 
follows: Assume we want to push the shopping cart sideways (maybe we are blocking 
someone). This is rather difficult (the plant has low gain in this direction) so a 
strong force is needed. However, if there is any uncertainty in our knowledge about 
the direction the cart is pointing, then some of our applied force will be directed 
forwards (where the plant gain is large) and the cart will suddenly move forward 
with an undesired large speed. We thus see that the control of an ill-conditioned 
plant may be especially difficult if there is input uncertainty which can cause the 
input signal to “spread” from one input direction to another. We will discuss this in 
more detail later. 
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Example 3.6 Distillation process. Consider the following steady-state model of 


a distillation column 
87.8 —86.4 
GS Re ea G45) 


The variables have been scaled as discussed in Section 1.4. Thus, since the elements 
are much larger than 1 in magnitude this suggests that there will be no problems with 
input constraints. However, this is somewhat misleading as the gain in the low-gain 
direction (corresponding to the smallest singular value) is actually only just above 1. 
To see this consider the SVD of G: 


A 
0.625 —0.781] [197.2 0 0.707 —0.708 
eal a 0 153 Be Beal (3.44) 
———— ~“ 
U >») vi 


From the first input singular vector, v; = [0.707 —0.708], we see that the gain is 
197.2 when we increase one inputs and decrease the other input by a similar amount. 
On the other hand, from the second input singular vector, v2 = [ —0.708 —0.707]", 
we see that if we increase both inputs by the same amount then the gain is only 
1.39. The reason for this is that the plant is such that the two inputs counteract each 
other. Thus, the distillation process is ill-conditioned, at least at steady-state, and 
the condition number is 197.2/1.39 = 141.7. The physics of this example is discussed 
in more detail below, and later in this chapter we will consider a simple controller 
design (see Motivating robustness example No. 2 in Section 3.7.2). 


Example 3.7 Physics of Distillation process. The model in (3.43) represents 
two-point (dual) composition control of a distillation column, where the top 
composition is to be controlled at yn = 0.99 (output yi) and the bottom composition 
at tp = 0.01 (output yo), using reflux L (input ui) and boilup V (input u2) as 
manipulated inputs (see Figure 10.6 on page 454). Note that we have here returned 
to the convention of using u, and uz to denote the manipulated inputs; the output 
singular vectors will be denoted by % and u. 

The 1,1-element of the gain matrix G is 87.8. Thus an increase in ui by 1 (with us 
constant) yields a large steady-state change in y1 of 87.8, that is, the outputs are very 
sensitive to changes in u,. Similarly, an increase in us by 1 (with u, constant) yields 
yi = —86.4. Again, this is a very large change, but in the opposite direction of that 
for the increase in u1. We therefore see that changes in ui and u2 counteract each 
other, and if we increase u1 and uz simultaneously by 1, then the overall steady-state 
change in yi is only 87.8 — 86.4 = 1.4. 

Physically, the reason for this small change is that the compositions in the 
distillation column are only weakly dependent on changes in the internal flows (i.e., 
simultaneous changes in the internal flows L and V ). This can also be seen from the 
smallest singular value, o(G) = 1.39, which is obtained for inputs in the direction 


_ | —0.708 —0.781 
aS ee 0.695 we see that the effect 


is to move the outputs in different directions, that is, to change y1 — y2. Therefore, 
it takes a large control action to move the compositions in different directions, that 
is, to make both products purer simultaneously. This makes sense from a physical 
point of view. 


. From the output singular vector u= 
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On. the other hand, the distillation column is very sensitive to changes in external 
flows which make L and V more different (i.e., increase ui — up = L—V). This 
0.707 
—0.708 
singular value, and is a general property of distillation columns where both products 
are of high purity. The reason for this is that the external distillate flow (which varies 
as V — L) has to be about equal to the amount of light component in the feed, and 
even a small imbalance leads to large changes in the product compositions. 


can be seen from the input singular vector 0 = associated with the largest 


For dynamic systems the singular values and their associated directions 
vary with frequency, and for control purposes it is usually the frequency 
range corresponding to the closed-loop bandwidth which is of main interest. 
The singular values are usually plotted as a function of frequency in a Bode 
magnitude plot with a log-scale for frequency and magnitude. Typical plots 
are shown in Figure 3.6. 


| LL a® 
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a(G) 
1 zal 
10 & 10 
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Frequency [rad/s] Frequency [rad/s] 
(a) Spinning satellite in (3.72). (b) Distillation process in (3.77 


Figure 3.6: Typical plots of singular values 


Nonsquare plants 


The SVD is also useful for nonsquare plants. For example, consider a plant 
with 2 inputs and 3 outputs. In this case the third output singular vector, us, 
tells us in which output direction the plant cannot be controlled. Similarly, for 
a plant with more inputs than outputs, the additional input singular vectors 
tell us in which directions the input will have no effect. 


Exercise 3.6 For a system with m inputs and 1 oulput, what is the interpretation 
of the singular values and the associated input directions (V)? What is U in this 
case? (Answer: V2,...,Um yield the input directions with no effect on the output. 


U=1.) 
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Use of the minimum singular value of the plant 


The minimum singular value of the plant, o(G(jw), evaluated as a function 
of frequency, is a useful measure for evaluating the feasibility of achieving 
acceptable control. If the inputs and outputs have been scaled as outlined 
in Section 1.4, then we can, with an manipulated input of unit magnitude 
(measured with the 2-norm), achieve an output of magnitude of at least o(G) 
in any output direction. We generally want o(G) as large as possible, at at least 
to avoid input saturation we want o(G) larger than about 1 at all frequencies 
where control is required. 


Remark. The relationship between o(G) an input saturation is discussed in 
Section 6.9. In Section 10.3 on selection of controlled outputs it is shown that it may 
be desirable to have o(G(jw) large even when input saturation is not a concern. 
The minimum singular value of the plant to analyze achievable performance was 
discussed by Morari (1983) and Yu and Luyben (1986) call o(G(jw)) the “Morari 
resiliency index”. 


3.3.5 Singular values for performance 


So far we have used the SVD primarily to gain insight into the directionality 
of MIMO systems. But the maximum singular value is also very useful in 
terms of frequency-domain performance and robustness. We here consider 
performance. 

For a one degree-of-freedom feedback control system the closed-loop 
response from references, 7, to control errors, e = y — Tr, is e = —Sr. We have 
previously argued for SISO systems that |S(jw)| evaluated as a function of 
frequency gives useful information about the effectiveness of feedback control, 
because for sinusoids |e(w)|/|r(w)| = |S(jw)|. For MIMO systems a useful 
generalization results if we consider the ratio ||e(w)||2/||r(w)||2 where || - |l2 is 
the vector 2-norm. As explained above, this ratio depends on the direction of 
r(w) and we have from (3.40) 


o(S(jw)) < SO? < 5(S(jw)) (3.45) 


In terms of performance, it is reasonable to require that the ratio 
l|e(w) ||2/||r(w) ||2 remains small for any direction of r(w), including the “worst- 
case” direction which gives the ratio as a(S(jw)). Let at each frequency 
1/|wp(jw)| represent the maximum allowed magnitude of |le||2/||r|l2. This 
results in the following performance requirement: 


a(S(jw)) < 1/|we(jw)|, Vo & G(wpS) <1, Vw 
& |lweS|lo <1 (3.46) 
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where the H.. norm (see page 60) is defined as the peak of the maximum 
singular value of the frequency response 


IIM(s)loo = max o(M (jw)) (3.47) 


By using the #.. norm to define performance most of the ideas presented in 
Chapter 2 can be generalized to MIMO systems. Typical performance weights 
are given in Section 2.7.2, which should be studied carefully. 

The singular values of S(jw) may be plotted as a function of frequency 
as illustrated in Figure 3.10 (a). Typically, they are small at low frequencies 
where feedback is effective, and they approach 1 at high frequencies because 
any real system is strictly proper. The maximum singular value, a(.S(jw)), 
usually has a peak larger than 1 at crossover frequencies. This peak is 
undesirable, but it is unavoidable for real systems. 

As for SISO systems we define the bandwidth as the frequency up to 
which feedback is effective. For MIMO systems the bandwidth will depend 
on directions, and we have a bandwidth region between a lower frequency 
where the maximum singular value, a(S), reaches 0.7 (the low-gain or worst- 
case direction), and a higher frequency where the minimum singular value, 
a(S), reaches 0.7 (the high-gain or best direction). If we want to associate a 
single bandwidth frequency for a multivariable system, then we consider the 
worst-case (low-gain) direction, and define 


e Bandwidth, wg: Frequency where a(S) crosses wa = 0.7 from below. 
It is then understood that the bandwidth is at least wg for any direction of 


the input (reference or disturbance) signal. Since $ = (I+ L)~', (A.51) yields 


iyi ae Sali (3.48) 


Thus at frequencies where feedback is effective (namely where o(L) > 1) we 
have a(S) = 1/a(Z)), and at the bandwidth frequency (where 1/a(S(jwg)) = 
/2 = 1.41) we have that o(L(jwg)) is between 0.41 and 2.41. Thus, the 
bandwidth is approximately where o(L(jw)) crosses 1. Finally, at higher 
frequencies where for any real system o(L) (and a(L)) is small we have that 


a(S) <1. 


3.4 Control of multivariable plants 


Consider the simple feedback system in Figure 3.7. A conceptually simple 
approach to multivariable control is given by a two-step procedure in which 
we first design a “compensator” to deal with the interactions in G, and then 


86 MULTIVARIABLE FEEDBACK CONTROL 
d 


Figure 3.7: One degree-of-freedom feedback control configuration. 


design a diagonal controller using methods similar to those for SISO systems. 
This approach is discussed below. 

The most common approach is to design a pre-compensator, W,(s), which 
counteracts the interactions in the plant and results in a “new” shaped plant: 


Gs(s) = G(s)Wi(s) (3.49) 


which is more diagonal and easier to control than the original plant G(s). 
After finding a suitable W,(s) we can design a diagonal controller K,(s) for 
the shaped plant. The overall controller is then: 


K(s) = Wi(s)Ks(s) (3.50) 


In many cases effective compensators may be derived on physical grounds and 
may include nonlinear elements such as ratios. 

Some design approaches in this spirit are the Nyquist Array technique of 
Rosenbrock (1974) and the characteristic loci technique of MacFarlane and 
Kouvaritakis (1977). In the latter, W, is usually chosen as a constant matrix 
obtained by inverting a real approximation of G(jw) at a given frequency. 


Remark. The . loop-shaping design procedure described in detail in Section 9.4 
is similar in that a precompensator is first chosen to yield a shaped plant, G, = GW, 
with desirable properties, and then a controller K,(s) is designed. The main 
difference is that K,(s) is a full multivariable controller, and secondly it is designed 
based on optimization (to optimize Hoo robust stability). 
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3.4.1 Decoupling 


Decoupling results when the compensator is chosen such that G, in (3.49) is 
diagonal at a selected frequency. The following different cases are possible: 


1. Dynamic decoupling: G's(s) is diagonal (at all frequencies). For example, 
with G,(s) = J and a square plant, we get W1 = G~1(s) (disregarding the 
possible problems involved in realizing G~1+(s)). In this case a reasonable 
overall controller (corresponding to K,(s) = l(s)I with e.g. I(s) = k/s) 
might be given by 


K(s) = Kinv(s) 2 1(s)G-}(s) (3.51) 
We will later refer to (3.51) as an inverse-based controller. It results 
in a decoupled nominal system with identical loops, i.e. D(s) = I(s)I, 
l(s 
S(s) = muy! and T(s) = one 


In some cases we may want to keep the diagonal elements in the shaped 
plant unchanged by selecting W, = G~'Gajag. In other cases we may want 
the diagonal elements in W; to be 1; this may be obtained by selecting 
Wi =G"((G"" diag). 

2. Steady-state decoupling: G'(0) is diagonal. This may be obtained by 
selecting a constant precompensator W,; = G~1(0) (and for a non-square 
plant we may use the pseudo-inverse provided G(0) has full row (output) 
rank). 

3. Approximate decoupling at frequency Wo: Gs(jwo) is as diagonal as possible. 
This is usually obtained by choosing a constant precompensator W, = G5! 
where G, is a real approximation of G(jw,). G, may be obtained, for 
example, using the align algorithm of Kouvaritakis (1974). The bandwidth 
frequency is a good selection for w, because the effect on performance of 
reducing interaction is normally greatest at this frequency. 


The idea of using a decoupling controller is appealing, but there are several 
difficulties: 


1. We cannot in general choose G;, freely. For example, W1(s) must not cancel 
any RHP-zeros in G(s). 

2. If the plant has RHP-zeros then the requirement of decoupling generally 
introduces extra RHP-zeros in the closed-loop system, see Section 6.5.1. 

3. As we might expect, decoupling may be very sensitive to modelling 
errors and uncertainties. This is illustrated in Section 3.7.2 and is further 
discussed in Section 6.10. 

4. The requirement of decoupling may not be desirable for disturbance 
rejection. The reasons are similar to those given for SISO systems in 
Section 2.6.4, and are discussed further below, see (3.54). 
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Even though decoupling controllers may not be desirable in practice, 
they are of interest from a theoretical point of view. They also yield 
insights into the limitations imposed by the multivariable interactions on 
achievable performance. One popular design method, which essentially yields 
a decoupling controller is the internal model control (IMC) approach (Morari 
and Zafiriou, 1989). 

Another common strategy, which avoids most of the problems just 
mentioned, is to use partial (one-way) decoupling where Gs(s) in (3.49) is 
upper or lower triangular. 


3.4.2. Pre- and post-compensators and the 
SVD-controller 


The above pre-compensator approach may be extended by introducing a post- 
compensator W2(s), as shown in Figure 3.8. One then designs a diagonal 


Figure 3.8: Pre- and post-compensator, W; and W2. K, is diagonal. 
controller K, for the shaped plant W2GW,. The overall controller is then 
K(s) =W,K,W2 (3.52) 


The SVD-controller is a special case of a pre- and post-compensator design. 
Here W; = V, and W2 = ur , where V, and U, are obtained from a singular 
value decomposition of Go = Uo"oV,’, where Go is a real approximation 
of G(jw.) at a given frequency w, (often around the bandwidth). SVD- 
controllers are studied by Hung and MacFarlane (1982), and by Hovd, Braatz 
and Skogestad (1994) who found that the SVD controller structure is optimal 
in some cases, e.g. for plants consisting of symmetrically interconnected 
subsystems. 

In summary, the SVD-controller provides a useful class of controllers. By 
by selecting K, =1(s)=;1 a decoupling design is achieved, and by selecting 
a diagonal K, with a low condition number (7(K’,) small) we generally get a 
robust controller (see Section 6.10). 
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3.4.3. Diagonal controller (decentralized control) 


Another simple approach to multivariable controller design is to use a diagonal 
or block-diagonal controller K’(s). This is often referred to as decentralized 
control. Clearly, this works well if G(s) is close to diagonal, because then the 
plant to be controlled is essentially a collection of independent sub-plants, 
and each element in A’(s) may be designed independently. However, if off- 
diagonal elements in G(s) are large, then the performance with decentralized 
diagonal control may be poor because no attempt is made to counteract the 
interactions. 


3.4.4 What is the shape of the “best” feedback 
controller? 


Consider the problem of disturbance rejection. The closed-loop disturbance 
response is y = SGad. Suppose we have scaled the system (see Section 1.4) 
such that at each frequency the disturbances are of magnitude 1, ||d||2 < 1, 
and our performance requirement is that ||y|l2 < 1. This is equivalent to 
requiring @(SGq) <1. In many cases there is a trade-off between input use 
and performance, such that the controller that minimizes the input magnitude 
is one that yields all singular values of SG4q equal to 1, ie., 0;(SGa) = 1, Vw. 
This corresponds to 

SminGa = U1 (3.53) 


where U;(s) is an all-pass transfer function which at each frequency has all 

its singular values equal to 1. The subscript min refers to the use of the 
smallest loop gain that satisfies the performance objective. For simplicity, 
we assume that Gy is square so U;(jw) is a unitary matrix. At frequencies 
where feedback is effective we have S$ = (I+ L)~! = L~1, and (3.53) yields 
Lmin = GKmin © GaU,'. In conclusion, the controller and loop shape with 
the minimum gain will often look like 


Kmin © G'GaU, Lin & GaU (3.54) 


where U = Uy is some all-pass transfer function matrix. This provides a 
generalization of |Kmin| & |G~'Ga| which was derived in (2.59) for SISO 
systems, and the conclusions following (2.59) on page 52 therefore also apply 
to MIMO systems. For example, we see that for disturbances entering at 
the plant inputs, Ga = G, we get Kmin = U, so a simple constant unit 
gain controller yields a good trade-off between output performance and input 
usage. We also note with interest that it is generally not possible to select a 
unitary matrix U such that Dmin = GaU is diagonal, so a decoupling design 
is generally not optimal for disturbance rejection. These insights can be used 
as a basis for a loop-shaping design; see more on H., loop-shaping in Chapter 
9. 


90 MULTIVARIABLE FEEDBACK CONTROL 


3.4.5 Multivariable controller synthesis 


The above design methods are based on a two-step procedure in which we 
first design a pre-compensator (for decoupling control) or we make a pairing 
selection (for decentralized control) and then we design a diagonal controller 
K,(s). Invariably this two-step procedure results in a suboptimal design. 

The alternative is to synthesize directly a multivariable controller K (s) 
based on minimizing some objective function (norm). We here use the word 
synthesize rather than design to stress that this is a more formalized approach. 
Optimization in controller design become prominent in the 1960’s with 
“optimal control theory” based on minimizing the expected value of the output 
variance in the face of stochastic disturbances. Later, other approaches and 
norms were introduced, such as H.. optimal control. We next provide a brief 
summary of the S/K'S and other mixed-sensitivity H.. design methods which 
are used in later examples. 


3.4.6 Summary of mixed-sensitivity H,. design (S/KS) 
In the S/KS problem the objective is to minimize the H.. norm of 


Wes 


as ie 


(3.55) 
This problem was discussed for SISO systems earlier, and it is recommended 
to read Section 2.7.3 carefully. A sample MATLAB file is provided in Example 
2.11, page 64. 

The following issues should be considered when selecting the the weights 
Wp and W,: 


1. KS is the transfer function from r to u in Figure 3.7, so for a system which 
has been scaled as in Section 1.4, a reasonable initial choice for the input 
weight is W,, = I. 

2. S is the transfer function from d to e = y —r. A common choice for the 
performance weight is Wp = diag{wp;} with 


_ 8/M; +wh,; 


i », A<i1 3.56 
bee s+wh, Ai a ( ) 


(see also Figure 2.26 on 63). Selecting A; < 1 ensures approximate integral 
action with S(0) = 0. Often we select M; = 2 for all outputs, whereas 
Wz, may be different for each output. A large value of w}, yields a faster 
response for output 7. 

3. To find a reasonable initial choice for the weight Wp, one can first design 
a controller with some other method, plot the magnitude of the resulting 
diagonal elements of S as a function of frequency, and then use a rational 
approximation of 1/|S;;|) for wp;(s). 
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4. For disturbance rejection, we may in some cases want a steeper slope for 
wp;(s) at low frequencies than that given in (3.56), e.g. as illustrated by 
the weight in (2.74). However, it may be better to consider the disturbances 
explicitly by considering the H. norm of 


_ | WeS WeSGa 
ae ees ace S97) 
or equivalently 
WpSW, ‘ 
v= Ee a wih es (3.58) 


where N represents the transfer function from [4 to the weighted outputs 
Fea In some situations we may want to adjust Wp or Gq in order 
to better satisfy our original objectives. The helicopter case study in 
Section 12.2 illustrates this by introducing a scalar parameter @ to adjust 
the magnitude of Gg. 

5. T is the transfer function from —n to y. To reduce sensitivity to noise and 
uncertainty, we want T small at high frequencies, that is, we may want 
additional roll-off in L. This can be achieved in several ways. One approach 
is to add WrT to the stack for N in (3.55), where Wr = diag{w7;} and 
|w7;| is smaller than 1 at low frequencies and large at high frequencies. 
A more direct approach is to add high-frequency dynamics, W1(s), to the 
plant model to ensure that the resulting shaped plant, G; = GW, rolls off 
with the desired slope. We then obtain an H..-optimal controller K, for 
this shaped plant, and finally include W,(s) in the controller, K = W,Ks. 


More details about H. design are given in Chapter 9. 


3.5 Introduction to multivariable RHP-zeros 


By means of an example, we now give the reader an appreciation of the fact 
that MIMO systems also have zeros even though their presence may not be 
obvious from the elements of G(s). As for SISO systems, we find that RHP- 
zeros impose fundamental limitations on control. 

Zeros of MIMO systems are defined as the values s = z where G(s) loses 
rank, see Section 4.5, and we can find the direction of the zero by looking 
at the direction in which the matrix G(z) has zero gain. For square systems 
we essentially have that the poles and zeros of G(s) are the poles and zeros 
of det G(s). However, this crude method may fail in some cases, as it may 
incorrectly cancel poles and zeros at the same location but with different 
directions. 
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Example 3.8 Consider the following plant 


G(s) = 


1 


(0.28 + 1)(s +1) 


| 


1 


1 


1+2s 2 


| 


(3.59) 


The responses to a step in each individual input are shown in Figure 3.9 (a) and 
(b). We see that the plant is interactive, but for these two inputs there is no inverse 


2 
15 Vy \ye 
| y2 os! \ 
| k 
1 —- ——=| ii 
} 
¥1 o 
0.5 0.5 
0 -l —— 
0 5 10 0 5 10 0 5 10 
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(a) Step in wi, u 


[1 oj”. 


(b) Step in uz, u 


fo 1”. 


(c) Combined step in u1 


and u2,u=[1 


aL: 


Figure 3.9: Open-loop response for G(s) in (3.59) 


response to indicate the presence of a RHP-zero. Nevertheless, the plant does have a 
multivariable RHP-zero at z = 0.5, that is, G(s) loses rank at s =0.5. The singular 
value decomposition of G(0.5) is 


A 
1 , h 0.45 0.89 1.92 0 0.71 —0.71 
Clays 1.65 E | =, Fe aye 0 | Ee 0.71 i300) 
e——— Sr —9™—S eee 
U x VA 


and we have as expected o(G(0.5)) = 


corresponding to the RHP-zero are v = 


0. The input and output directions 


—0.71 
0.71 


and u 


| 


0.89 
—0.45 


l: Thus, the 


RHP-zero is associated with both inputs and with both outputs. The presence of the 
multivariable RHP-zero is also observed from the time response in Figure 3.9 (c), 
which is for a simultaneous input change in opposite directions, u=[1 ae We 
see that y2 displays an inverse response whereas y; happens to remain at zero for 
this particular input change. 

To see how the RHP-zero affects the closed-loop response, we design a controller 
which minimizes the Hoo norm of the weighted S/KS matriz 


_ | Wes 
v= [88] 0 
with weights (see (3.56)) 
wei 0 8/Mi +wpi -4 
v=, = 3 4 = ; A,j=1 62 
W, Wp 0 iA WP 3+ wy, Ai 0 (3 6 ) 
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The MATLAB file for the design is the same as in Table 2.8 on 65, except that we 
now have a 2x 2 system. Since there is a RHP-zero at z= 0.5 we expect that this 
will somehow limit the bandwidth of the closed-loop system. 

Design 1. We weight the two outputs equally and select 


Design 1: My = M2 =1.5; wp, =wp = 2/2 = 0.25 


This yields an Ho. norm for N of 2.80 and the resulting singular values of S are 
shown by the solid lines in Figure 3.10 (a). The closed-loop response to a reference 
change r =[1 AAG is shown by the solid lines in Figure 3.10 (b). We note that 
both outputs behave rather poorly and both display an inverse response. 


2 Design 1: —— 
© Design 2: - - - 
a) 

5 
= 
i] 
&0 
a 
= 
Po Design 1: —— 
xo Design 2: - - - 
2 2 
10” 10° 10° 0 1 2 3 4 
Frequency [rad/s] Time [sec] 
Singul 1 fs (b) Response to change in refer- 
(a) Singular values of S. ener (ie SIE: 


Figure 3.10: Alternative designs for 2 x 2 plant (3.59) with RHP-zero. 


Design 2. For MIMO plants, one can often move most of the deteriorating effect 
(e.g., inverse response) of a RHP-zero to a particular output channel. To illustrate 
this, we change the weight wp2 so that more emphasis is placed on output 2. We do 
this by increasing the bandwidth requirement in output channel 2 by a factor of 100: 


Design 2: M, = M2. =1.5; we, = 0.25, we = 25 


This yields an optimal design with an Ho norm of 2.92. In this case we see from the 
dashed line in Figure 3.10 (b) that the response for output 2 (y2) is excellent with 
no inverse response. However, this comes at the expense of output 1 (yi) where the 
response is somewhat poorer than for Design 1. 

Design 3. We can also interchange the weights wp; and wp2 to stress output 1 
rather than output 2. In this case (not shown) we get an excellent response in output 
1 with no inverse response, but output 2 responds very poorly (much poorer than 
output 1 for Design 2). Furthermore, the Hoo norm is 6.73, whereas it was only 2.92 
for Design 2. 

Thus, we see that it is easier in this example to get tight control of output 2 
than of output 1. This may be expected from the output direction of the RHP-zero, 


“= evel which is mostly in the direction of output 1. We will discuss this in 


more detail in Section 6.5.1. 
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Remark 1 We find from this example that we can direct the effect of the RHP-zero 
to either of the two outputs. This is typical of multivariable RHP-zeros, but there 
are cases where the RHP-zero is associated with a particular output channel and 
it is not possible to move its effect to another channel. The zero is then called a 
“pinned zero” (see Section 4.6.2). 


Remark 2 It is observed from the plot of the singular values in Figure 3.10 (a), 
that we were able to obtain with Design 2 a very large improvement in the “good” 
direction (corresponding to o(.S)) at the expense of only a minor deterioration in the 
“bad” direction (corresponding to a(S)). Thus Design 1 demonstrates a shortcoming 
of the H.. norm: only the worst direction (maximum singular value) contributes to 
the Hoo norm and it may not always be easy to get a good trade-off between the 
various directions. 


3.6 Condition number and RGA 


Two measures which are used to quantify the degree of directionality and the 
level of (two-way) interactions in MIMO systems, are the condition number 
and the relative gain array (RGA), respectively. We define the two measures 
and present an overview of their practical use. We do not prove each result, but 
refer to other places in the book where the details can be found. Some algebraic 
properties of the condition number and the RGA are given in Appendix A.4.1. 


3.6.1 Condition number 


We define the condition number of a matrix as the ratio between the maximum 
and minimum singular values 


(G) = a(G)/a(G) (3.63) 


A matrix with a large condition number is said to be ill-conditioned. For a 
nonsingular (square) matrix a(G) = 1/a(G~'), so y(G) = a(G)a(G"). It 
then follows from (A.117) that the condition number is large if both G and 
G—" have large elements. 

The condition number depends strongly on the scaling of the inputs and 
outputs. To be more specific, if D, and Dz are diagonal scaling matrices, 
then the condition numbers of the matrices G and D;GDz2 may be arbitrarily 
far apart. In general, the matrix G should be scaled on physical grounds, for 
example, by dividing each input and output by its largest expected or desired 
value as discussed in Section 1.4. 

One might also consider minimizing the condition number over all possible 
scalings. This results in the minimized or optimal condition number which is 
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defined by 
*(G) = min y(DiGDz2) (3.64) 
Di,D2 
and can be computed using (A.72). 

The condition number has been used as an input-output controllability 
measure, and in particular it has been assumed that a large condition number 
indicates sensitivity to uncertainty. This is not true in general, but the reverse 
holds: 


If the condition number is small, then multivariable effects of uncertainty is 
not likely to be a problem (see (6.72)). 


If the condition number is large (say, larger than 10), then this may indicate 
control problems: 


1. A large condition number 7(G) = a(G)/a(G) may be caused by a small 
value of o(G), which is generally undesirable (on the other hand a large 
value of g(G) need not necessarily be a problem). 

2. A large condition number may mean that the plant has a large minimized 
condition number, or equivalently, it has large RGA-elements, which 
indicates fundamental control problems, see below. 


Remark. A large condition number does imply that the system is sensitive to 
“unstructured” (full-block) input uncertainty with an inverse-based controller, 
(see (8.135)), but this kind of uncertainty may not occur in practice. We can 
therefore not conclude that a plant with a large condition number is sensitive to 
uncertainty, see also the example plant in . 


3.6.2 Relative Gain Array (RGA 


The Relative Gain Array (RGA) of a nonsingular square matrix G' is a square 
matrix defined as 


RGA(G) = A(G) £4 x (G/F (3.65) 


where x denotes element-by-element multiplication (the Schur product). For 
a 2x 2 matrix with elements g;; the RGA is 


Au = (3.66) 


Ls 912921 
911922 


A »12 M1 1— Au 
A(G) = = : 
(@) ie | E —Au Au |’ 


The RGA has a number of interesting algebraic properties, of which the most 
important are (see Appendix A.4 for more details): 


1. It is independent of input and output scaling. 
2. Its rows and columns sum to one. 
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3. The sum-norm of the RGA, |lAllsum, is very close to the minimized 
condition number 7*, see (A.77). This means that plants with large RGA- 
elements are always ill-conditioned (with a large value of y(G)), but the 
reverse may not hold (i.e., a plant with a large 7(G) may have small RGA- 
elements). 

4. A relative change in an element of G equal to the negative inverse of its 
corresponding RGA-element yields singularity. 

5. The RGA is the identity matrix if G is upper or lower triangular. 


From the last property it follows that the RGA (or more precisely A — I) 
provides a measure of two-way interaction. 

The definition of the RGA may be generalized to non-square matrices by 
using the pseudo inverse, see Appendix A.4.2. 

Bristol (1966), originally introduced the RGA as a steady-state measure 
of interactions for decentralized control. Unfortunately, based on the original 
definition, many people have dismissed the RGA as being “only meaningful at 
w = 0”. To the contrary, in most cases it is the value of the RGA at frequencies 
close to crossover which is most important. 

In addition to the algebraic properties listed above, the RGA has a 
surprising number of useful control properties: 


1. The RGA is a good indicator of sensitivity to uncertainty: 


(a) Uncertain in the input channels (diagonal input uncertainty). Plants 
with large RG A-elements around the crossover-frequency are fundamen- 
tally difficult to control because of sensitivity to this uncertainty, e.g., 
caused by uncertain or neglected actuator dynamics. In particular, de- 
couplers or other inverse-based controllers should not be used for such 
plants (see page 260). 

(b) Element uncertainty. As stated by algebraic property no. 4 above, large 
RGA-elements imply sensitivity to element-by-element uncertainty. 
However, this source of uncertainty may not occur due to physical 
couplings between the transfer function elements, so the diagonal input 
uncertainty mentioned above (which always is present) is usually of more 
concern. 


2. RGA and RHP-zeros. If the sign of an RGA-element changes from s = 0 
to s = oo, then there is a RHP-zero somewhere (see Theorem 10.7). 

3. Non-square plants. Extra inputs: If the sum of the elements in a column of 
RGA are small (< 1), then one may consider deleting the corresponding 
input. Extra outputs: If all elements in a row of RGA are small (< 1),then 
the corresponding output cannot be controlled. (See Section 10.4.) 

4. Diagonal dominance. The RGA can be used to measure diagonal 
dominance, by the simple quantity 


RGA-number = ||A(G) — T||sum (3.67) 
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For decentralized control we prefer pairings for which the RGA-number at 
crossover frequencies is close to 1 (see pairing rule 1 on page 463). Similarly, 
for certain multivariable design methods, it is simpler to choose the weights 
and shape the plant if we first rearrange the inputs and outputs to make 
the plant diagonally dominant with a small RGA-number. 

5. RGA and decentralized control. 


(a) Integrity: For stable plants avoid pairing on negative steady-state RGA- 
elements. Otherwise, if the sub-controllers are designed independently 
each with integral action, the interactions will cause instability either 
when all of the loops are closed, or when the loop corresponding to the 
negative relative gain becomes inactive (e.g. because of saturation) (see 
Theorem 10.6). Interestingly, this is the only use of the RGA directly 
related to its original definition. 

(b) Stability: Prefer pairings corresponding to an RGA-number close to 0 at 
crossover frequencies (see page 463). 


Remark. An iterative evaluation of the RGA, A?(G) = A(A(G)) ete., has in 
applications proved to be useful for choosing pairings for large systems. Wolff (1994) 
found numerically that (with the exception of “borderline” cases) 


Awe Jim AKG) (3.68) 


is a permuted identity matrix (the result is proved for a positive definite Hermitian 
matrix G by Johnson and Shapiro (1986). Typically, A* approaches A® for k 
between 4 and 8. This permuted identity matrix may then be used as a candidate 


ae : eed 22 _ [0.33 0.67] y2 _ 
pairing choice. For example, for G = a | we get A = Pee Boal = 
—0.33 1.33 3 _ |—0.07 1.07 4 _ |0.00 1.00 i si 
1.33 veal ,xe = 1.07 07 and A* = Bee ae which indicates 


that the off-diagonal pairing should be considered. Note that 
Lambda® may sometimes “recommend” a pairing on negative RGA-elements, even 
if a positive pairing exists. 


Example 3.9 Consider a diagonal plant 


100 0 * 

eel t]. A) = 1, 0G) = HB = 29 — 100, 7¢@) =1 

Here the condition number is large which means that the plant gain depends strongly 
on the input direction. However, since the plant is diagonal there are no interactions 
so A(G) = I and y*(G) = 1, and no sensitivity to uncertainty (or other control 
problems) is normally expected. 


Remark. An exception would be if there was uncertainty caused by unmodelled or 
neglected off-diagonal elements in G. This would couple the high-gain and low-gain 
directions, and the large condition number indicates sensitivity to this off-diagonal 
(“unstructured”) uncertainty. 
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Example 3.10 Consider the triangular matrix 


12 2 1 -2 2.41 . 

c=) i],e7= [4 P) MG) =4 MG) = FF = 588, 7G) =1 
(3.69) 

Note that for a triangular matrix the RGA is always the identity matriz and y*(G) 


is always 1. 


Example 3.11 Consider again the distillation process for which we have at steady- 
state 


_ | 87.8 —86.4 -1_ |0.399 —0.315 _ | 85.1 —34.1 
aes Ee Snel ne ee ee ey GIS Bw 35.1 


(3.70) 
In this case y(G) = 197.2/1.391 = 141.7 is only slightly larger than y*(G) = 138.268. 
The magnitude sum of the elements in the RGA-matriz is ||A||sum = 138.275. This 
confirms (A.78) which states that, for 2 x 2 systems, ||A(G)||sum © y~(G) when 
y"(G) is large. The condition number is large, but since the minimum singular 
value o(G) = 1.391 is larger than 1 this does not by itself imply a control problem. 
However, the RGA-elements (and y~(G)) are large, which indicate control problems; 
and fundamental control problems are expected if an analysis shows that G(jw) has 
large RGA-elements also in the crossover frequency range. (Indeed, the idealized 
dynamic model (3.77) used below has large RGA-elements at all frequencies, and we 
will confirm in simulations a strong sensitivity to input channel uncertainty with an 
inverse-based controller ). 


Example 3.12 Consider a3 x 3 plant for which we have 


16.8 30.5 4.30 1.50 0.99 ~—1.48 
-16.7 31.0 1.41 0.41 0.97 0.45 (3.71) 
1.27 54.1 5.40 —0.08 0.95 2.03 


G= , A(G) = 


and y = 69.6/1.63 = 42.6 and y* = 7.80. The magnitude sum of the elements in the 

RGA is ||Allsum = 8.86 which is close to y* as expected from (A.77). Note that rows 
and the columns of A sum to 1. Since a(G) is larger than 1 and the RGA-elements 
are relatively small, this steady-state analysis does not indicate any particular control 
problems for the plant. 


Remark. (3.71) represents the steady-state model of a fluid catalytic cracking (FCC) 
process. A dynamic model of the FCC process in (3.71) is given in Exercise 6.17. 


For a more detailed analysis of achievable performance of the plant (input- 
output controllability analysis), one must also also consider the singular 
values, RGA and condition number as a function of frequency. In particular, 
the crossover frequency range is important. In addition, disturbances and the 
presence of unstable (RHP) plant poles and zeros must be considered. All 
these issues are discussed in much more detail in Chapters 5 and 6 where we 
discuss achievable performance and input-output controllability analysis for 
SISO and MIMO plants, respectively. 
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3.7 Introduction to robustness 


To motivate the need for a deeper understanding of robustness, we present 
two examples which illustrate that MIMO systems can display a sensitivity to 
uncertainty not found in SISO systems. We focus our attention on diagonal 
input uncertainty, which is present in any real system and often limits 
achievable performance because it enters between the controller and the plant. 


3.7.1. Motivating robustness example no. 1: Spinning 
Satellite 


Consider the following plant (Doyle, 1986; Packard, Doyle and Balas, 1993) 
which can itself be motivated by considering the angular velocity control of a 
satellite spinning about one of its principal axes: 


1 s—a?— a(s+1) 
s2+a2|—a(st+1) s-—a? 


G(s) = ; a=10 (3.72) 


A minimal, state-space realization, G = C(sI — A)-1B + D, is 


0 a/1 0 
s [A | By} _}-a 0j0 1 
c4lot3|- 1 aj0O 0 ers) 
—a 1]/0 0 
The plant has a pair of jw-axis poles at s = +ja so it needs to be stabilized. 
Let us apply negative feedback and try the simple diagonal controller 
K=!I 
We get 
TRyHCrien Ss) 1% (3.74) 
7 ~~ s+1|-a 1 . 
The closed-loop system has two poles at s = —1 and so it is stable. This can 


be verified by evaluating the closed-loop state matrix 


0 a 1 a —-1 0 
Ag=A-BKC =| ° ‘lee Pel a 


(To derive A, use ¢ = Ax + Bu,y = Cx and u=-—K-y). 

First let us consider performance. The singular values of LD = GK =G are 
shown in Figure 3.6 (a). We see that o(L) = 1 at low frequencies and starts 
dropping off at about w = 10. Since o(L) never exceeds 1 this implies that we 
do not have tight control in the low-gain direction for this plant (recall the 
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discussion following (3.48)). For example, at steady-state o(T) = 10.05 and 
a(S) = 10. This is also seen from the large off-diagonal elements in T(s) which 
indicate strong interactions in the closed-loop system and poor performance. 
(For reference tracking, however, this may be counteracted by use of a two 
degrees-of-freedom controller.) 

Now let us consider stability robustness. In order to determine stability 
margins with respect to perturbations in each input channel, one may consider 
Figure 3.11 where we have broken the loop at the first input. The loop transfer 
function at this point (the transfer function from w, to 21) is [1(s) = 1/s 


(which is consistent with ti1(s) = 2. = 4& 


as Ter ). This corresponds to an 
1(s) 

infinite gain margin and a phase margin of 90°. On breaking the loop at the 

second input we get the same result. This suggests good robustness properties 

irrespective of the value of a. However, the design is far from robust as a further 


analysis shows. Consider input gain uncertainty, and let «; and e denote the 


ZY Wi 


Figure 3.11: Checking stability margins “one-loop-at-a-time” . 
relative error in the gain in each input channel. Then 
uy = (1 aa €1)t1, Us = (1 + €) Ug (3.75) 


where wi, and us are the actual changes in the manipulated inputs, while u1 
and ug are the desired changes as computed by the controller. It is important 
to stress that this diagonal input uncertainty, which stems from our inability 
to know the exact values of the manipulated inputs, is always present. In 
terms of a state-space description, (3.75) may be represented by replacing B 


by 
po 1+e6 0 
B =| 0 oad 
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The corresponding closed-loop state matrix is 


a 1 = 0 Qa) 14+4 0 1 a 
w= A-BKC= [9 al | 0 eal | 


which has a characteristic polynomial given by 


det(sI — Al,) =s? + (2+te.te)stlt+ateat(@+ijeae (3.76) 
as 


The perturbed system is stable if and only if both the coefficients ag and a, 
are positive. We therefore see that the system is always stable if we consider 
uncertainty in only one channel at a time (at least as long as the channel gain 
is positive). More precisely, we have stability for (—1 < e, < o,€. = 0) and 
(e, = 0,—1 < & < oo). This confirms the infinite gain margin seen earlier. 
However, the system can only tolerate small simultaneous changes in the two 
channels. For example, let €; = —e2, then the system is unstable (a9 < 0) for 

Je| > ! 0.1 

€1 ea aU. 
In summary, we have found that checking single-loop margins is inadequate 
for MIMO problems. We have also observed that large values of a(T) or a(S) 
indicate robustness problems. We will return to this in Chapter 8, where we 
show that with input uncertainty, |e;| < 1/a(T) guarantees robust stability 
(even for “full-block complex perturbations”). 

In the next example we find that we may have sensitivity to diagonal input 
uncertainty also in cases when o(T’) and a(S) have no large peaks. This can 
not happen for a diagonal controller, see (6.77), but it will happen if we use 
an inverse-based controlled for a plant with large RGA-elements, see (6.78). 


3.7.2. Motivating robustness example no. 2: Distillation 
Process 


The following is an idealized dynamic model of a distillation column 


1 87.8 —86.4 ] 


G(s) = F547 | 108.2 —109.6 (3.77) 


The physics of this example was discussed in Example 3.7. This is admittedly 
a very crude model of a distillation column; there should be a high-order lag 
in the transfer function from input 1 to output 2 to represent the liquid flow 
down to the column, and higher-order composition dynamics should also be 
included. Nevertheless, the model is simple and displays important features 
of distillation column behaviour. The plant is ill-conditioned with condition 
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number 7(G) = 141.7 at all frequencies. The plant is also strongly two-way 
interactive and the RGA-matrix at all frequencies is 


(3.78) 


RGA(G) = 35.1 al 


—34.1 35.1 


The large elements in this matrix indicate that this process is fundamentally 
difficult to control (see section A.4). It should be noted that with a more 
detailed model as just discussed, the RGA-elements would approach 1 at 
frequencies around 1 rad/min, indicating less of a control problem. 


2.57 ia Nee Nominal plant: .—— | 
ob Vs Perturbed plant: --- | 
1 as 
f % SR 
L VOSS 4 
1.5 ; x ae 
i : \ Sa oe e Yl 
by \) See 
' \ 
0.571 7S 7 
1 ae Y2 
0 34, = 
0 10 20 30 40 50 60 


Time [min] 


Figure 3.12: Response with decoupling controller to filtered reference input r; = 
1/(5s + 1). The perturbed plant has 20% gain uncertainty as given by (3.81). 


We consider the following inverse-based controller, which may also be looked 
upon as a steady-state decoupler with a PI controller: 


ee Bt G-1(¢) _ k(14+ 75s) [0.3994 —0.3149 


3 = Sg 0.3043 —0.3200|7 #1 =? (3-79) 
We have GKin,y = KinyG = O77, With no model error this controller should 
counteract all the interactions in the plant and give rise to two decoupled first- 
order responses each with time constant 1/0.7 = 1.43 min. This is confirmed 
by the solid line in Figure 3.12 which shows the simulated response to a 
reference change in y;. The responses are clearly acceptable, and we conclude 
that nominal performance (NP) is achieved with the decoupling controller. 

The resulting sensitivity and complementary sensitivity functions with this 
controller are 


8 1 


= he. PSS =F 
s+0.7° ee Cre (2:80) 


S= Sy 


Thus, a(S) and a(T) are both less than 1 at all frequencies, so there are no 
peaks indicating robustness problems. We also find that this controller gives 
an infinite gain margin (GM) and a phase margin (PM) of 90° in each channel. 
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Thus, use of the traditional margins and the peak values of S$ and T indicate 
no robustness problems. However, from the large RGA-elements there is cause 
for concern, and this is confirmed in the following. 

We consider again the input gain uncertainty (3.75) as in the previous 
example, and we select €«; = 0.2 and eg = —0.2. We then have 


uy =1.2u1, us =0.8u2 (3.81) 


Note that the uncertainty is on the change in the inputs (flow rates), and not 
on their absolute values. A 20% error is typical for process control applications 
(see Remark 2 on page 318). The uncertainty in (3.81) does not by itself yield 
instability. This is verified by computing the closed-loop poles, which assuming 
no cancellations are solutions to det (I+ L(s)) = det(I+L7(s)) = 0, see (4.103) 
and (A.11). In our case 
1 n 1 +e 0 0.7 1 + ey 0 
Li(s) = KinvG =KinG| 0 tre|= =| 0 eal 


so the perturbed closed-loop poles are 
sy, = —0.7(1 + 1), §2 => —0.7(1 + €2) (3.82) 


and we have closed-loop stability as long as the input gains 1+ e, and 1+«€ 
remain positive, so we can have up to 100% error in each input channel. We 
thus conclude that we have robust stability (RS) with respect to input gain 
errors for the decoupling controller. 

For SISO systems we generally have that nominal performance (NP) and 
robust stability (RS) imply robust performance (RP), but this is not the case 
for MIMO systems. This is clearly seen from the dotted lines in Figure 3.12 
which show the closed-loop response of the perturbed system. It differs 
drastically from the nominal response represented by the solid line, and even 
though it is stable, the response is clearly not acceptable; it is no longer 
decoupled, and y(t) and yo(t) reach a value of about 2.5 before settling at 
their desired values of 1 and 0. Thus RP is not achieved for the decoupling 
controller. 


Remark 1 There is a simple reason for the observed poor response to the reference 
change in y;. To accomplish this change, which occurs mostly in the direction 
corresponding to the low plant gain, the inverse-based controller generates relatively 
large inputs u; and ue, while trying to keep u; — u2 very small. However, the input 
uncertainty makes this impossible — the result is an undesired large change in the 
actual value of u', — ul, which subsequently results in large changes in y; and y2 
because of the large plant gain (¢(G) = 197.2) in this direction, as seen from (3.44). 


Remark 2 The system remains stable for gain uncertainty up to 100% because 
the uncertainty occurs only at one side of the plant (at the input). If we also 
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consider uncertainty at the output then we find that the decoupling controller yields 
instability for relatively small errors in the input and output gains. This is illustrated 
in Exercise 3.9 below. 


Remark 3 It is also difficult to get a robust controller with other standard design 
techniques for this model. For example, an S/K S-design as in (3.55) with Wp = wpl 
(using M = 2 and wg = 0.05 in the performance weight) and W, = I, yields a 
good nominal response (it is not decoupled, but the inputs are smaller than for the 
decoupling controller and remain less than 1 in magnitude), but the system is very 
sensitive to input uncertainty, and the outputs go up to about 3.4 and settle very 
slowly when there is 20% input gain error. 


Remark 4 Attempts to make the inverse-based controller robust using the second 
step of the 7... loop-shaping procedure are also unhelpful, see Exercise 3.7 This 
shows that robustness with respect to coprime factor uncertainty does not necessarily 
imply robustness with respect to input uncertainty. The solution is to avoid inverse- 
based controllers for a plant with large RGA-elements, also see 260. 


Exercise 3.7 Consider again the distillation process G(s) in (8.77). The response 
using the inverse-based controller Kiny in (8.79) was found to be sensitive to input 
gain errors. We want to see if controller can be modified to yield a more robust system 
by using the McFarlane-Glover Hoo. loop-shaping procedure. To this effect, let the 
shaped plant be Gs = G'Kinv, i.e. Wi = Kinv, and design an Hoo controller Ks for the 
shaped plant (see page 406 and Chapter 9), such that the overall controller becomes 
K = KinvKs. (You will find that ymin = 1.414 which indicates good robustness with 
respect to coprime factor uncertainty, but the loop shape is almost unchanged and 
the system remains sensitive to to input uncertainty.) 


Exercise 3.8 Design a SVD-controller K =W1K,W2 for the distillation process 
in (3.77), i.e. select W1 = V and Wz = UT where U and V are given in (3.44). 
Select Ks in the form 
co, T5841 0 
K.=|"" 0 C5 T5841 


and try the following values: 


(a) cy =c2 = 0.005; 
(b) c1 =0.005, co = 0.05; 
(c) c1 =0.7/197 = 0.0036, ¢2 = 0.7/1.39 = 0.504. 


Simulate the closed-loop reference response with and without uncertainty. Designs (a) 
and (b) should be robust. Which has the best performance? Design (c) should give 
the response in Figure 3.12. In the simulations, include high-order plant dynamics 
by replacing G(s) by CR PEES G(s). What is the condition number of the controller 


in the three cases? Discuss the results. (See also the conclusion on page 260). 


Exercise 3.9 Consider again the distillation process (3.77) with the decoupling 
controller, but also include output gain uncertainty €;. That is, let the perturbed loop 
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transfer function be 


1 _pAle. _ 0.7 144 0 1l+e 0 -1 
D(s) =G@ Kin = 3 0 me | 0 il G (3.83) 
SS 


Lo 


where Lo has no dynamics for the distillation example. The closed-loop poles of 
the perturbed system are solutions to det(I + L'(s)) = det(I + (ki/s)Lo) = 0, or 
equivalently 


det( =I + Lo) = (s/k1)? + tr(Lo)(s/k:) + det(Lo) = 0 (3.84) 


For ki > 0 we see that instability occurs if and only if the trace and/or the 
determinant of Lo are negative. Since det(Lo) > 0 for any gain error less than 
100%, instability can only occur if tr(Lo) < 0. Evaluate tr(Lo) and show that with 
gain errors of equal magnitude the combination of errors which most easily yields 
instability is with a €2 an €2 e. Use this to show that the perturbed 


system is unstable if 
1 
EY Ninel eee) 


where \11 = g11922/detG is the 1,1-element of the RGA of G. In our case 
Ai1 = 35.1 and we get instability for « > 0.120. Check this numerically. 


Remark. The instability condition in (3.85) for simultaneous input and output 
gain uncertainty, applies to the very special case of a 2 x 2 plant, in which all 
elements share the same dynamics, G(s) = g(s)Go, and an inverse-based controller, 


K(s) = (ki/s)G7'(s). 


3.7.3. Robustness conclusions 


From the two motivating examples above we found that multivariable plants 
can display a sensitivity to uncertainty (in this case input uncertainty) which 
is fundamentally different from what is possible in SISO systems. 

In the first example (spinning satellite), we had excellent stability margins 
(PM and GM) when considering one loop at a time, but small simultaneous 
input gain errors gave instability. This could be expected from the peak values 
(Hoo norms) for S and T, defined as 


|Tllac = maxa(T(jw)), — [\Slloo = maxa(S(jw)) (3.86) 


which were both large (about 10) for this example. 

In the second example (distillation process), we again had excellent stability 
margins (PM and GM), and the system was also robustly stable to errors (even 
simultaneous) of up to 100% in the input gains. However, in this case small 
input gain errors gave terrible output performance, so robust performance 
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was not satisfied, and adding simultaneous output gain uncertainty resulted 
in instability (see Exercise 3.9). These problems with the decoupling controller 
could be expected because the plant has large RGA-elements, see (6.78). For 
this example the H. norms of S and T were both about 1, so the absence of 
a peak in S and T does not guarantee robustness. 

Although sensitivity peaks, RGA-elements, etc. are useful indicators of 
robustness problems, they provide no exact answer to whether a given source 
of uncertainty will yield instability or poor performance. This motivates the 
need for better tools for analyzing the effects of model uncertainty. We want to 
avoid a trial-and-error procedure based on checking stability and performance 
for a large number of candidate plants. This is very time consuming, and in 
the end one does not know whether those plants are the limiting ones. What 
is desired, is a simple tool which is able to identify the worst-case plant. This 
will be the focus of Chapters 7 and 8 where we show how to represent model 
uncertainty in the H,., framework, and introduce the structured singular value 
pi as our tool. The two motivating examples are studied in more detail in 
Example 8.7 and Section 8.11.3 where a j-analysis predicts the robustness 
problems found above. 


3.8 General control problem formulation 


(weighted) (weighted) 
exogenous inputs exogenous outputs 
w z 


v 
control signals sensed outputs 


Figure 3.13: General control configuration for the case with no model uncertainty 


In this section we consider a general method of formulating control problems 
introduced by Doyle (1983; 1984) The formulation makes use of the general 
control configuration in Figure 3.13, where P is the generalized plant and K 
is the generalized controller as explained in Table 1.1 on page 13. Note that 
positive feedback is used. 

The overall control objective is to minimize some norm of the transfer 
function from w to z, for example, the H. norm. The controller design 
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problem is then: 


e Find acontroller K which based on the information in v, generates an input 
u which counteracts the influence of w on z, thereby minimizing the norm 
from w to z. 


The configuration in Figure 3.13 may at first glance seem restrictive. 
However, this is not the case, and we will demonstrate the generality of the 
setup with a few examples, including the design of observers (the estimation 
problem) and feedforward controllers. The most important point of this 
section is to appreciate that almost any linear control problem can be 
formulated using the block diagram in Figure 3.13 (for the nominal case) 
or in Figure 3.20 (with model uncertainty). 


Remark. We may generalize the control configuration still further by including 
diagnostics as additional outputs from the controller giving the 4-parameter 
controller introduced by Nett (1986), but this is not considered in this book. 


3.8.1 Obtaining the generalized plant P 


The routines in MATLAB for synthesizing H.. and H2 optimal controllers 
assume that the problem is in the general form of Figure 3.13, that is, they 
assume that P is given. To derive P (and K) for a specific case we must first 
find a block diagram representation and identify the signals w, z, u and v. To 
construct P one should note that it is an open-loop system and remember to 
break all “loops” entering and exiting the controller AK. Some examples are 
given below and further examples are given in Section 9.3 (Figures 9.9, 9.10, 
9.11 and 9.12). 


Example 3.13 One degree-of-freedom feedback control configuration. 
We want to find P for the conventional one degree-of-freedom control configuration 
in Figure 8.14. The first step is to identify the signals for the generalized plant: 


Figure 3.14: One degree-of-freedom control configuration. 
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Figure 3.15: Equivalent representation of Figure 3.14 where the error signal to be 
minimized is z = y — r and the input to the controller is v = r — ym. 


Wi d 
w=l]wol=|[rl|; z=e=y-7r, v=r—-—Yym=r-y-n (3.87) 


With this choice of v, the controller only has information about the deviation r—Yym. 
Also note that z = y—r, which means that performance is specified in terms of the 
actual output y and not in terms of the measured output Ym. The block diagram in 
Figure 3.14 then yields 


z y-r=Gut+d—r=Iw, —Iw.+0uw3 + Gu 


v = T-Ym=r—-Gu-d-n Iw, + Iw2 — Iw3 — Gu 


and P which represents the transfer function matriz from[w uJ" to[z v]" is 


I -I 0 G 
BS 2h lo 2 Ce) 


Note that P does not contain the controller. Alternatively, P can be obtained by 
inspection from the representation in Figure 3.165. 


Remark. Obtaining the generalized plant P may seem tedious. However, when 
performing numerical calculations P can be generated using software. For example, 
in MATLAB we may use the simulink program, or we may use the sysic program 
in the y-toolbox. The code in Table 3.1 generates the generalized plant P in (3.88) 
for Figure 3.14. 
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Table 3.1: MATLAB program to generate P in (3.88). 
% Uses the Mu-toolbox 


systemnames = ’G’; % G is the SISO plant. 
inputvar = ’[d(1);r(1);n(1);u(1)]’; ¥, Consists of vectors w and u. 
input_toG = ’[u]’; 

outputvar = >[G+d-r; r-G-d-n]’; 7, Consists of vectors z and v. 
sysoutname = ’P’; 

sysic; 


3.8.2 Controller design: Including weights in P 


To get a meaningful controller synthesis problem, for example, in terms of 
the Ho or He norms, we generally have to include weights W, and W, in 
the generalized plant P. That is, we consider the weighted or normalized 
exogenous inputs w (where # = W,,w consists of the “physical” signals 
entering the system; disturbances, references and noise), and the weighted or 
normalized controlled outputs z = W.Z (where Z often consists of the control 
error y — r and the manipulated input u), and The weighting matrices are 
usually frequency dependent and typically selected such that weighted signals 
w and z are of magnitude 1, that is, such that the norm from w to z should be 
less than 1. Thus, in most cases only the magnitude of the weights matter, and 
we may without loss of generality assume that W,,(s) and W,(s) are stable 
and minimum phase (they need not even be rational transfer functions but if 
not they will be unsuitable for controller synthesis using current software). 


Example 3.14 Stacked S/T/KS problem. Consider an Hoo problem where we 
want to bound a(S) (for performance), o(T) (for robusiness and to avoid sensitivity 
to noise) and a(S) (to penalize large inputs). These requirements may be combined 
into a stacked Ho. problem 


W.ikS 
WrT 


min ||N(K)|lo, N= 
m Wes 


(3.89) 


where K is a stabilizing controller. In other words, we have z = Nw and the objective 
is to minimize the Hoo norm from w to z. Except for some negative signs which have 
no effect when evaluating ||N||.0, the N in (3.89) may be represented by the block 
diagram in Figure 3.16 (convince yourself that this is true). Here w represents a 
reference command (w = —r, where the negative sign does not really matter) or 
a disturbance entering at the output (w = dy,)), and z consists of the weighted 
input 21 = Wuu, the weighted output z2 = Wry, and the weighted control error 
z3 =Wp(y—r). To find P we must also “pull out” the controller, that is, we must 
break the loop before and after K in Figure 3.16. We get the following set of equations 
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z 
Figure 3.16: Block diagram corresponding to z = Nw in (3.89). 
z2 = WrGu 
23 = Wpwt+WrpGu 
v = -w-Gu 
so the generalized plant P from[w ul" to[z v]" is 
0 Wil 
- 0 WrG 
Pee Wel We (3.90) 
-—[ -G 
3.8.3 Partitioning the generalized plant P 
We often partition P as 
Pui Pis 
P= 3.91 
pa on 


such that its parts are compatible with the signals w, z, u and v in the 
generalized control configuration, 


a = Pyyw ale Pyou (3.92) 
Pow a Pou (3.93) 


U 


The reader should become familiar with this notation. For the above example, 
we get 
0 Wit 
Pu= 0 », Po=|WrG (3.94) 
Wpl WpG 
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Px =—I, Px2=—-G (3.95) 


Note that P22 has dimensions compatible with the controller, i.e., if K is an 
Ny X Ny matrix, then Py. is an n, X nN, matrix. For cases with one degree-of- 
freedom negative feedback control we have Po. = —G. 


3.8.4 Analysis: Closing the loop to get N 


i 


Figure 3.17: General block diagram for analysis with no uncertainty. 


The general feedback configuration in Figure 3.13 with the generalized plant 
P, has the controller K as a separate block. This configuration is useful when 
synthesizing the controller. However, for analysis of closed-loop performance 
the controller is given, and we may absorb K into the interconnection structure 
and obtain the system N as shown in Figure 3.17 where 


z=Nw (3.96) 


To find N, first partition the generalized plant P as given in (3.91)-(3.93), 
combine this with the controller equation 


u= Kv (3.97) 


and eliminate u and v from equations (3.92), (3.93) and (3.97) to yield z = Nw 
where WN is given by 


N= Pa 4 Pek Pok) ORs HRP ® (3.98) 


Here F,(P,K) denotes a lower Linear Fractional Transformation (LFT) of 
K around P. To assist in remembering the sequence of Piz and P>,, notice 
that the first (last) index in P,; is the same as the first (last) index in 
Py2K (I — Px2K)~1 P21. Some properties of LFTs are given in Appendix A.7. 

In words, N is obtained from Figure 3.13 by using K to close a lower 
feedback loop around P. Since positive feedback is used in the general 
configuration in Figure 3.13 the term (I — Po2K)~* has a negative sign. The 
lower LFT in (3.98) is also represented by the block diagram in Figure 3.2. 
The reader is advised to become comfortable with the above manipulations 
before progressing much further. 
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Example 3.15 We want to derive N for the partitioned P in (8.94) and (8.95) 
using the LFT-formula in (3.98). We get 


0 Wl —W.uKS 
N=] 0 |4+|WrG| KI+GK)\(-D =| —Wrt 
Wel WeG Wes 


where we have made use of the identities GKS = T and I—T = S. With exception of 
the two negative signs this is identical to N given in (3.89). Of course, the negative 
signs have no effect on the norm of N. 


Again, it should be noted that deriving N from P is much simpler using 
available software. For example in the MATLAB p-Toolbox we can evaluate 
N = Fi(P,K) using the command N=starp(P,K) . Here starp denotes the 
matrix star product which generalizes the use of LFTs (see Appendix A.7.5). 


Exercise 3.10 Consider the two degrees-of-freedom feedback configuration in 
Figure 1.3(b). (i) Find P when 


d yr r 
v=[>] “=| i le are (3.99) 


(i) Let z = Nw and derive N in two different ways; directly from the block diagram 
and using N = Fi(P,K). 


3.8.5 Generalized plant P: Further examples 


To illustrate the generality of the configuration in Figure 3.13, we now present 
two further examples: one in which we derive P for a problem involving 
feedforward control, and one for a problem involving estimation. 


Example 3.16 Consider the control system in Figure 3.18, where y is the output 
we want to control, yo is the secondary output (extra measurement), and we also 
measure the disturbance d. By secondary we mean that y2 is of secondary importance 
for control, that is, there is no control objective associated with it. The control 
configuration includes a two degrees-of-freedom controller, a feedforward controller 
and a local feedback controller based on the extra measurement y2. To recast this into 
our standard configuration of Figure 3.18 we define 


Tr 

d y 
=|°); z=y-n v= 3.100 
(3.100) 

d 


Note that d andr are both inputs and outputs to P and we have assumed a perfect 
measurement of the disturbance d. Since the controller has explicit information about 
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d 


Figure 3.18: System with feedforward, local feedback and two degrees-of-freedom 
control. 


r we have a two degrees-of-freedom controller. The generalized controller K may be 
written in terms of the individual controller blocks in Figure 8.18 as follows: 


K=[KiKy —Ki —K2 Ka] (3.101) 


By writing down the equations or by inspection from Figure 3.18 we get 


G: -I GiGs 
0 © oO 
P=|G: 0 GiGs (3.102) 
0 0 Gs 
I 0 0 


Then partitioning P as in (3.92) and (3.93) yields Pox =[0 GiG2 Gs O]’. 


Exercise 3.11 Cascade implementation Consider Example 3.16. The local 
feedback based on y2 is often implemented in a cascade manner, see also Figure 10.4. 
In this case the output from Ky enters into Kz and it may be viewed as a reference 
signal for yo. Derive the generalized controller K and the generalized plant P in this 
case. 


Remark. From the above exercises we see that a cascade implementation does 
not usually limit the achievable performance since, unless the optimal Ke or Ky 
have RHP-zeros, we can obtain from the optimal K the subcontrollers K, and 
Ky, (we may have to add a small D-term to K to make the controllers proper). 
However, if we impose restrictions on the design such that, for example K2 or Ky 
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are designed “locally” (without considering the whole problem), then this will limit 
the achievable performance. For example, for a two degrees-of-freedom controller 
a common approach is to first design the feedback controller K1 (or Ky) for 
disturbance rejection (without considering reference tracking) and then design K, 
for reference tracking. This will generally give some performance loss compared to 
a simultaneous design of K, and K;,. 


Example 3.17 Output estimator. Consider a situation where we have no 
measurement of the output y which we want to control. However, we do have a 
measurement of another output variable y2. Let d denote the unknown external inputs 
(including noise and disturbances) and ua the known plant inputs (a subscript q@ is 
used because in this case the output u from K is not the plant input). Let the model 
be 
y=Guet+Gad yo = Fue Fad 

The objective is to design an estimator, Kest, such that the estimated output 


~~ 


y= ext 2] is as close as possible in some sense to the true output y; see 


Figure 3.19. This problem may be written in the general framework of Figure 3.13 
with 


_|d a = y2 
ieee ee a 
Note that u = ¥, that is, the output u from the generalized controller is the estimate 
of the plant output. Furthermore, K = Kest and 


Ga G -I 
P=|Fi F 0 (3.103) 
0 J 0 


We see that Po2 = [3 since the estimator problem does not involve feedback. 


Exercise 3.12 State estimator. In the Kalman filter problem the objective is to 
minimize x —& (whereas in Example 3.17 the objective was to minimize y — y). 
Show how the Kalman Filter problem can be represented in the general configuration 
of Figure 3.18 and find P. 


3.8.6 Deriving P from N 
For cases where N is given and we wish to find a P such that 
N = F(P,K) = Pi t+ Pi2K (I — Pook)! Px 


it is usually best to work from a block diagram representation. This was 
illustrated above for the stacked N in (3.89). Alternatively, the following 
procedure may be useful: 


1. Set K =0 in N to obtain Py. 
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Figure 3.19: Output estimation problem. One particular estimator Kest is a 
Kalman Filter. 


2. Define A = N — P,, and rewrite A such that each term has a common 
factor B = K(I — Pog K)~" (this gives Po2). 
3. Since A = Pig BP 1, we can now usually obtain Pjz and Ps; by inspection. 


Example 3.18 Weighted sensitivity. We will use the above procedure to derive 

P when N =wpS =wp(I+GK)~' where wp is a scalar weight. 

1. Py = N(K =0)=wpl. 

2. A= N—-—wpl = we(S —1) = —weT = —weGK(I+GK)~', and we have 
B=K(I+GK)~' so Px2 = —-G. 

8 A=—wpGB so we have Piz = —wpG and Px = 1, and we get 


_ |wel —wpG 
P=| a. eae (3.104) 


Remark. When obtaining P from a given N, we have that P,; and P22 are unique, 
whereas from Step 3 in the above procedure we see that Piz and Poi_are not unique. 
For instance, let a be a real scalar then we may instead choose Piz = aPi2 and 
Py = (1/a)P21. For P in (3.104) this means that we may move the negative sign 
or the scalar wp from Pi2 to P21. 
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Exercise 3.13 Mixed sensitivity. Use the above procedure to derive the 
generalized plant P for the stacked N in (3.89). 


3.8.7 Problems not covered by the general formulation 


The above examples have demonstrated the generality of the control 
configuration in Figure 3.13. Nevertheless, there are some controller design 
problems which are not covered. Let N be some closed-loop transfer function 
whose norm we want to minimize. To use the general form we must first obtain 
a P such that N = F;(P, K). However, this is not always possible, since there 
may not exist a block diagram representation for N. As a simple example, 
consider the stacked transfer function 


(I+ a] 


(4 KO) (3.105) 


“| 


The transfer function ([ + GK)~! may be represented on a block diagram 
with the input and output signals after the plant, whereas (I + KG)! 
may be represented by another block diagram with input and output signals 
before the plant. However, in N there are no cross coupling terms between 
an input before the plant and an output after the plant (corresponding to 
G(I + KG)~*), or between an input after the plant and an output before the 
plant (corresponding to —K(I+GK)~1) so N cannot be represented in block 
diagram form. Equivalently, if we apply the procedure in Section 3.8.6 to N 
in (3.105), we are not able to find solutions to Pjz and Ps; in Step 3. 

Another stacked transfer function which cannot in general be represented 
in block diagram form is 


N= Bal (3.106) 


Remark. The case where N cannot be written as an LFT of K, is a special case of 
the Hadamard-weighted H.. problem studied by van Diggelen and Glover (1994a). 
Although the solution to this 7(.. problem remains intractable, van Diggelen and 
Glover (1994b) present a solution for a similar problem where the Frobenius norm 
is used instead of the singular value to “sum up the channels”. 


Exercise 3.14 Show that N in (3.106) can be represented in block diagram form 
if Wp = wpl where wy is a scalar. 


3.8.8 A general control configuration including model 
uncertainty 


The general control configuration in Figure 3.13 may be extended to include 
model uncertainty as shown by the block diagram in Figure 3.20. Here the 


Figure 3.22: Rearranging a system with multiple perturbations into the NA- 
structure. 
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3.9 Additional exercises 


Most of these exercises are based on material presented in Appendix A. The 
exercises illustrate material which the reader should know before reading the 
subsequent chapters. 


Exercise 3.15 Consider the performance specification ||wpS|loo. Suggest a 
rational transfer function weight wp(s) and sketch it as a function of frequency 
and suggest a rational transfer function wp(s) for the following two cases: 


1. We desire no steady-state offset, a bandwidth better than 1 rad/s and a resonance 
peak (worst amplification caused by feedback) lower than 1.5. 

2. We desire less than 1% steady-state offset, less than 10% error up to frequency 
3 rad/s, a bandwidth better than 10 rad/s, and a resonance peak lower than 2. 
Hint: See (2.73) and (2.74). 


Exercise 3.16 By ||M/||.. one can mean either a spatial or temporal norm. Explain 
the difference between the two and illustrate by computing the appropriate infinity 
norm for 


3.«4 


s—1 3 
m=[% §]) se 


s+1s42 


Exercise 3.17 What is the relationship between the RGA-matriz and uncertainty 
in the individual elements? Illustrate this for perturbations in the 1,1-element of the 


matrix 
10 9 
A= 9 | (3.109) 


Exercise 3.18 Assume that A is non-singular. (i) Formulate a condition in terms 
of singular value of E for the matriz A+ E to remain non-singular. Apply this to A 
in (3.109) and (ii) find an E of minimum magnitude which makes A+ E singular. 


Exercise 3.19 Compute ||Alli1, (A) = |lAlliz, |JAllico, [|Allz, ||Allmax and 
\|Al|sum for the following matrices and tabulate your results: 


1 0 de Lh) 1 0 
won nel hor[t ome ola 


Show using the above matrices that the following bounds are tight (i.e. we may have 
equality) for 2 x 2 matrices (m = 2): 


(A) < ||Allr < vm a(A) 
I|Allmax < (A) < m[Al|max 
I|Allan/Wm < @(A) < VmllAlla 


I|Allico/ Wm < @(A) < Vml|A}lice 
|All < ||Allsum 
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Exercise 3.20 Find example matrices to illustrate that the above bounds are also 
tight when A is a square m xX m matriz with m > 2. 


Exercise 3.21 Do the singular values bound the magnitude of the elements of a 
matriz? That is, is (A) larger or equal to the largest element (in magnitude), and 
is a(A) smaller or equal to the smallest element? For a non-singular matrix, how is 
a(A) related to the largest element in A~'? 


Exercise 3.22 Consider a lower triangular mxm matrix A with aig = —1, aij = 1 
for alli > j, and ai; =0 for alli <j. 

a) What is det A ? 

b) What are the eigenvalues of A ? 

c) Show that the smallest singular value is less than or equal to 2~™. 

d) What is the RGA of A? 

Let m = 4 and find an E with the smallest value of (FE) such that A+ E is 
singular. 


is-) 


Exercise 3.23 Find two matrices A and B such that p(A +B) > p(A) + p(B) 
which proves that the spectral radius does not satisfy the triangle inequality and is 
thus not a norm. 


Exercise 3.24 Write T= GK(I+GK)~' as an LFT of K, i.e. find P such that 
T = F(P,K). 


Exercise 3.25 Write K as an LFT of T=GK(I+GK) |, i.e. find J such that 
K = FJ,T). 


Exercise 3.26 State-space descriptions may be represented as LFTs. To 
demonstrate this find H for 


F,(H,1/s) = C(sf — A)"'B+D 


Exercise 3.27 Show that the set of all stabilizing controllers in (4.94) can be 
written as K = Fi(J,Q) and find J. 


Exercise 3.28 In (3.11) we stated that the sensitivity of a perturbed plant, S' = 
(1+ G'K)~', is related to that of the nominal plant, S =(I+GK)~* by 


S' =S(I+ EoT)"' 


where Eo = (G' — G)G™'. This exercise deals with how the above result may 
be derived in a systematic (though cumbersome) manner using LFTs (see also 
(Skogestad and Morari, 1988a)). 

a) First find F such that S' = (I+ G'K)~' = Fi(F,K), and find J such that 
K =F(J,T) (see Exercise 3.25). 

b) Combine these LFTs to find S' = F,(N,T). What is N in terms of G and G'?. 
Note that since Ji; =0 we have from (A.155) 


Fis FioSi2 


N= 
Joi Fo, Jog + Joi Fo2Ji2 
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c) Evaluate S' = Fi(N,T) and show that 
S=1-@G@'TU-U-G@a"')r) 


d) Finally, show that this may be rewritten as S' = S(I+ EoT)'. 


A 


ELEMENTS OF LINEAR 
SYSTEM ‘THEORY 


The main objective of this chapter is to summarize important results from linear 
system theory which will be required latex in the book. The treatment is thorough, 
but readers are encouraged to consult other books, such as Kailath (1980) or Zhou, 
Doyle and Glover (1996), for more details and background information if these results 
are new to them. 


4.1 System descriptions 


The most important property of a linear system (operator) is that it satisfies 
the superposition principle.. Let f(u) be a linear operator and uw, and wz two 
independent variables then 


f(ui + ua) = f(ur) + f (ue) (4.1) 


We use in this book various representations of time-invariant linear systems, 
all of which are equivalent for systems that can be described by linear ordinary 
differential equations with constant coefficients and which do not involve 
differentiation of the inputs (independent variables). The most important of 
these representations are discussed in this section. 


4.1.1 State-space representation 


Consider a system with m inputs (vector u) and / outputs (vector y) which 
has an internal description of n states (vector x). A natural way to represent 
many physical systems is by nonlinear state-space models of the form 


t= f(r,u); y = g(z, u) (4.2) 


where & = dx/dt. Linear state-space models may then be derived from the 
linearization of such models. They provide a convenient means of describing 


124 MULTIVARIABLE FEEDBACK CONTROL 


the dynamic behaviour of proper, rational, linear systems. In terms of 
deviation variables (where x represents a deviation from some nominal value 
or trajectory, etc.) we have 


é(t) = Ax(t) + Bu(t) (4.3) 


y(t) = Cx(t) + Du(t) (4.4) 


where A, B, C and D are real matrices. If (4.3) is derived by linearizing (4.2) 
then A = Of/Ox and B = Of /Ou (see Section 1.5 for an example of such a 
derivation). A is sometimes called the state matrix. These equations may be 


rewritten as 
| |A Ble 
y| |C Di fu 


which gives rise to the short-hand notation 


as Fare (4.5) 


which is frequently used to describe a state-space model of a system G. 
Note that the representation in (4.3-4.4) is not a unique description of the 
input-output behaviour of a linear system. First, there exist realizations with 
the same input-output behaviour, but with additional unobservable and/or 
uncontrollable states (modes). Second, even for a minimal realization (a 
realization with the fewest number of states and consequently no unobservable 
or uncontrollable modes) there are an infinite number of possibilities. To see 
this let S be an invertible constant matrix, and introduce the new states 
q = Sz, ie., © = S~'g. Then an equivalent state-space realization (i.e. 
one with the same input-output behaviour) in terms of these new states 
(“coordinates” ) is 


Aya SAS; By S30, =CS Dp, =D (4.6) 


The most common realizations are given by a few canonical forms, such as 
the Jordan (diagonalized) canonical form , the observability canonical form, 
etc. 

Given the linear dynamical system in (4.3) with an initial condition x(to) 
and an input u(t), the dynamical system response x(t) for t > to can be 
determined from 


t 
a(t) = eA) e(to) + / eAC—7) Bu(r)dr (4.7) 
to 


where the matrix exponential is e44 = I + 7%. ,(At)*/k!. The output is 
then given by y(t) = Ca(t) + Du(t). For a diagonalized realization (where 
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SAS! = A is a diagonal matrix) we have that e4’ = diag{e*(4)'}, where 
A;(A) is the 7’th eigenvalue of A. 
For a system with disturbances d and measurement noise n the state-space- 
model is written as 
£= Ar+ Bu+ Ed (4.8) 


y=Ca+Du+Fd+n (4.9) 


4.1.2 Impulse response representation 


The impulse response matrix is 


0 t<0 
a= { CeAtB+D6(t) t>0 (210) 


where 6(t) is the unit impulse function which satisfies lim,9 f; 6(t)dt = 1. 
The ij’th element of the impulse response matrix, gjj(t), represents the 
response y;(¢) to an impulse u,;(t) = d(¢) for a system with a zero initial 
state. 

The dynamic response to an arbitrary input u(t) with initial conditions 
«(0) = 0 may then, from (4.7), be written as 


y(t) = g(t) x u(t) = i g(t — T)u(r)dr (4.11) 


where * denotes the convolution operator. 


4.1.3 Transfer function representation - Laplace 
transforms 


The transfer function representation is unique and is very useful for directly 
obtaining insight into the properties of a system. It is defined as the Laplace 
transform of the impulse response 


G(s) = i g(t)e ** dt (4.12) 


Alternatively, we may start from the state-space description. With the 
assumption of zero initial conditions, x(t = 0) = 0, the Laplace transforms of 
(4.3) and (4.4) become? 


sa(s) = Ar(s)+ Bu(s) = 2(s) = (sf — A)~'Bu(s) (4.13) 


T We make the usual abuse of notation and let f(s) (rather than e.g. f(s)) denote the 
Laplace transform of f(t). 
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y(s) = Cx(s) + Du(s) > y(s)=(C(sI—A)7'B+D)u(s) (4.14) 
———————_—_—_ — 
G(s) 
where G(s) is the transfer function matrix. Equivalently, 


1 


G(s) = mero oo — A)B + Ddet(sI — A)| (4.15) 


where adj(/) denotes the adjugate (or classical adjoint) of MM which is the 
transpose of the matrix of cofactors of M. From the Appendix 


n n 


det (sf — A) = [J ds(s — A) = J] (s — (A) (4.16) 


i=1 i=1 


When disturbances are treated separately, see (4.8) and (4.9), the 
corresponding disturbance transfer function is 


Ga(s) =C(sI — A)"'E4+F (4.17) 


Note that any system written in the state-space form of (4.3) and (4.4) has a 
transfer function, but the opposite is not true. For example, time delays and 
improper systems can be represented by Laplace transforms, but do not have a 
state-space representation. On the other hand, the state-space representation 
yields an internal description of the system which may be useful if the model 
is derived from physical principles. It is also more suitable for numerical 
calculations. 


4.1.4 Frequency response 


An important advantage of transfer functions is that the frequency response 
(Fourier transform) is directly obtained from the Laplace transform by setting 
s = jw in G(s). For more details on the frequency response the reader is 
referred to Sections 2.1 and 3.3. 


4.1.5 Coprime factorization 


Another useful way of representing systems is the coprime factorization which 
may be used both in state-space and transfer function form. In the latter case 
a right coprime factorization of G' is 


G(s) = N,(s)Mz2(s) (4.18) 


where N,(s) and M,(s) are stable coprime transfer functions. The stability 
implies that N,(s) should contain all the RHP-zeros of G(s) and M;(s) should 
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contain as RHP-zeros all the RHP-poles of G(s). The coprimeness implies that 
there should be no common RHP-zeros in N, and M,. which result in pole-zero 
cancellations when forming N,.M,-!. Mathematically, coprimeness means that 
there exist stable U,(s) and V,(s) such that the following Bezout identity is 
satisfied 

U,Nr+V,M, =I (4.19) 


Similarly, a left coprime factorization of G is 
G(s) = M,"(s)Ni(s) (4.20) 


Here N; and M, are stable and coprime, that is, there exist stable U;(s) and 
Vi(s) such that the following Bezout identity is satisfied 


NU, + MV, =1 (4.21) 


For a scalar system, the left and right coprime factorizations are identical, 
G=NM1=M—!N. 


Remark. Two stable scalar transfer functions, N(s) and M(s), are coprime if and 
only if they have no common RHP-zeros. In this case we can always find stable U 
and V such that NU+ MV =1. 


Example 4.1 Consider the scalar system 


_ (s—1)(s+ 2) 
G(s) = (s—3)(s +4) (4.22) 


To obtain a coprime factorization, we first make all the RHP-poles of G zeros of M, 
and all the RHP-zeros of G zeros of N. We then allocate the poles of N and M so 
that N and M are both proper and the identity G = NM~' holds. Thus 


s—1 go38 


N(s) = 4, Ms) = 5 


is a coprime factorization. Usually, we select N and M to have the same poles as 
each other and the same order as G(s). This gives the most degrees of freedom subject 
to having a realization of [N M]" with the lowest order. We then have that 

(s — 1)(s + 2) (s —3)(s +4) 


N(s) =k", M(s)=k 
(s) ee ar ee 


= 4.2 
stkistke : ( 3) 


is a coprime factorization for any k and for any ki, k2 > 0. 


From the above example, we see that the coprime factorization is not unique. 
Now introduce the operator M* defined as M*(s) = M7(—s) (which for 
s = jw is the same as the complex conjugate transpose M7? = M7). Then 


G(s) = N,(s)M,-1(s) is called a normalized right coprime factorization if 


M*M, + N#N, =1 (4.24) 
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In this case X,(s) = [M, N,]’ satisfies X*X, = I and is called an 
inner transfer function. The normalized left coprime factorization G(s) = 
M, '(s)Ni(s) is defined similarly, requiring that 


MM? +N,Nf=1 (4.25) 


In this case X)(s) = [M) Ni]" is co-inner which means XX/ = I. 
The normalized coprime factorizations are unique to within a right (left) 
multiplication by a unitary matrix. 


Exercise 4.1 We want to find the normalized coprime factorization for the scalar 
system in (4.22). Let N and M be as given in (4.23). and substitute them into 
(4.24). Show that after some algebra and comparing of terms one obtains: k = +0.71, 
ky = 5.67 and ko = 8.6. 


To derive normalized coprime factorizations by hand, as in the above 
exercise, is in general difficult. Numerically, however, one can easily find a 
state space realization. If G has a minimal state-space realization 


a4 Lepr 


then a minimal state-space realization of a normalized left coprime 
factorization is given (Vidyasagar, 1985) by 


A+HC B+HD A 
[ Ni(s) Mi(s) i | RI2C | RD RP 


where 
H2-(BD'+z07)R", R2I+DpDT 


and the matrix Z is the unique positive definite solution to the algebraic 
Riccati equation 


(A— BS-'D?C)Z + Z(A— BS-'DTC)? — ZCTR''CZ+ BS ‘Bt =0 
(4.26) 
where 
S214D7D. 
Notice that the formulae simplify considerable for a strictly proper plant, i.e. 


when D = 0. The MATLAB commands in Table 4.1 find the normalized 
coprime factorization for G(s) in (4.22). 


Exercise 4.2 Verify numerically (e.g., using the MATLAB file in Table 4.1 or the 
-toolboz command sncfbal) that the normalized coprime factors of G(s) in (4.22) 
are as given in Exercise 4.1. 
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Table 4.1: MATLAB commands to generate normalized 
coprime factorization. 


%, Uses the Mu toolbox 


G=zp2sys([1 -2],[3 -4]); % G(s) in (4.22) 
[a,b,c,d]=unpck(G) ; 

h 

% Find Normalized Coprime factors of system [a,b,c,d]. 
h 


S=eye(size(d’*d) )+d’ *d; 
R=eye(size(d*d’))+d*d’; 


Al = a-b*inv(S)*d’*c; 
R1 = c’*inv(R)*c; 
Qi = b*inv(S)*b’; 


[z1,z2,fail,reig_min] = ric_schr([A1’ -R1; -Q1 -A1]); Z = z2/z1; 
¥, Alternative: aresolv in Robust control toolbox: 

% [z1,z2,eig,zerr,zwellposed,Z] = aresolv(A1’,Q1,R1); 

H = -(b*d’? + Z*c’)*inv(R); 

A =a + Hc; 

Bn = b + Hxd; Bm = 4H; 

C = inv(sqrt(R))*c; 

Dn = inv(sqrt(R))*d; Dm = inv(sqrt(R)); 

pek(A,Bn,C,Dn); 

pek(A,Bm,C,Dm) ; 


4.1.6 More on state-space realizations 


Inversion. In some cases we may want to find a state-space description of 
the inverse of a system. For a square G(s) we have 


eta, = A-—BD7'C | BD 
G(s) = —p-"G (2a 


(4.27) 


where D is assumed to be nonsingular. For a non-square G(s) in which D has 
full row (or column) rank, a right (or left) inverse of G(s) can be found by 
replacing D~! by Dt, the pseudo-inverse of D. 

For a strictly proper system with D = 0, one may obtain an approximate 
inverse by including a small additional feed-through term D, preferably chosen 
on physical grounds. One should be careful, however, to select the signs of the 
terms in D such that one does not introduce RHP-zeros in G(s) because this 
will make G(s)~+ unstable. 

Improper systems. Improper transfer functions, where the order of the 
s-polynomial in the numerator exceeds that of the denominator, cannot be 
represented in standard state-space form. To approximate improper systems 
by state-space models, we can include some high-frequency dynamics which 
we know from physical considerations will have little significance. 

Realization of SISO transfer functions. Transfer functions are a 
good way of representing systems because they give more immediate insight 
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into a systems behaviour. However, for numerical calculations a state-space 
realization is usually desired. One way of obtaining a state-space realization 
from a SISO transfer function is given next. Consider a strictly proper transfer 
function (D = 0) of the form 


Bn—18" "1 +--+ +18 + Bo 
G(s) = 
8” + An—18™ 1 +-++- +415 +49 
and the relationship y(s) = G(s)u(s). Then, since multiplication by s 
corresponds to differentiation in the time domain, (4.28) and the relationship 
correspond to the following differential equation 


(4.28) 


y” (t)+an—1y”*(t)+:--+ary' (t)+aoy(t) = Bn—1u"—* (t) +: --+1u' (t) + Bou(t) 


where y”~1(t) and u”~1(t) represent n — 1’th order derivatives, etc. We can 
further write this as 


y” = (—a@n_1y”* + Bn—1u"*) +--+ + (—ary’ + Bru") + (—aoy + Bor) 


an 


where we have introduced new variables 71, %2,...%, and we have y = 4. 
Note that 2? is the n’th derivative of #,(t). With the notation ¢ = 2’(t) = 
dx/dt, we have the following state-space equations 


En = —Aox1 + Sou 
En1 = —a1%,+ fyut+an 
Ey = —Apn—-1%1 + Bn-1u+ 22 


corresponding to the realization 


—adn_1 1 0: 0 0 Bn—1 
—an-2 0 1 0 0 Bn—2 
sell), So ek Be eel “ee (4.29) 
-a. 0 0 1 0 Bo 
-—a, 00 - 0 1 feat 
-—ag 00 - 0 0 Bo 
C=[1 0 0 -::: 0 0] 


This is sometimes called the observer canonical form. Two advantages of this 
realization are that one can obtain the elements of the matrices directly from 
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the transfer function, and that the output y is simply equal to the first state. 
Notice that if the transfer function is not strictly proper, then we must first 
bring out the constant term, i.e. write G(s) = G,(s) + D, and then find the 
realization of G(s) using (4.29). 


Example 4.2 To obtain the state-space realization, in observer canonical form, 
of the SISO transfer function G(s) = ==", we first bring out a constant term by 


st 
division to get 
s—a —2a 
— = 1 
(8) apd s+a ry 
Thus D =1. For the term =22 we get from (4.28) that Bo = —2a and ao =a, and 


sta 
therefore (4.29) yields A = —a, B = —2a and C = 1. 


Example 4.3 Consider an ideal PID-controller 


2 
Ke eae ee) (4.30) 


TIS TIS 


Since this involves differentiation of the input, it is an improper transfer function and 

cannot be written in state-space form. A proper PID controller may be obtained by 

letting the derivative action be effective over a limited frequency range. For example 
1 TDS 


K(s) = K-(14 t 4.31 
(s) ( Trs 1+erps (e8h) 


where ¢€ is typically 0.1 or less. This can now be realized in state-space form in an 
infinite number of ways. Four common forms are given below. In all cases, the D- 
matrix, which represents the controller gain at high frequencies (s > co), is a scalar 
given by 

l+e 


D= Ke (4.32) 
1. Diagonalized form (Jordan canonical form) 
_ 10 0 _ Ke/tr £, 7 
ae[? 2]. 22[ tity) cet 1 ess 
2. Observability canonical form 
0 1 
Ant sail) pale C=[1 0] (4.34) 
=D 
1 1 Ke 
where i Ke(— aa) 2 aGe (4.35) 
8. Controllability canonical form 
0 0 1 
ae a E ane C=I[n 7] (4.36) 
=D 


where y; and y2 are as given above. 
4. Observer canonical form in (4.29) 
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foes eee tay! 8 O16] (4.37) 
0 0 ’ Bo ’ 
2 a 
wie aes fy = KK. (4.38) 
€TITD E€°TITD 


On comparing these four realizations with the transfer function model in 
(4.31), it is clear that the transfer function offers more immediate insight. One 
can at least see that it is a PID controller. 

Time delay. A time delay (or dead time) is an infinite-dimensional system 
and not representable as a rational transfer function. For a state-space 
realization it must therefore be approximated. An n’th order approximation of 
a time delay 0 may be obtained by putting n first-order Padé approximations 


in series ; 
1— ss)" 
—O8 ns Vast) (4.39) 
(1+ 358)” 


Alternative (and possibly better) approximations are in use, but the above 
approximation is often preferred because of its simplicity. 


4.2 State controllability and state 
observability 

Definition 4.1 State controllability The dynamical system & = Ax + Bu 

or the pair (A,B) is said to be state controllable if, for any initial state 


x(0) = %o, any time ty > 0 and any final state x1, there exists an input u(t) 
such that x(t;) = 2. Otherwise the system is said to be state uncontrollable. 


From (4.7) one can verify that a particular input which achieves x(t) = 74 
is 

u(t) = —BTeA” (4-9 Wy, (t,)-1(e4" 29 — 2) (4.40) 

where W,(t) is the Gramian matrix at time t, 

A f° T 
W.(t) = ‘| e4T BBT eA dr (4.41) 
ft) 

There are many ways to check whether a system is state controllable. First, we 
have that (A, B) is state controllable if and only if the controllability matrix 
c4[B AB A?B ... A™1B] (4.42) 


has rank n (full row rank). Here n is the number of states. 


ELEMENTS OF LINEAR SYSTEM THEORY 133 


Example 4.4 Consider a scalar system with two states and the following state- 
space realization 


The transfer function is 


1 


G(s) =C(sl— A)'B= ‘aa 


and a minimal realization has only one state. In fact, the first state is not 
controllable. This is verified since the controllability matriz has two linearly 
dependent rows: 


C=[B AB]= E 4 
Second, from (4.40), we have that the system (A, B) is state controllable 
if and only if the Gramian matrix W,(t) has full rank (and thus is positive 
definite) for any t > 0. For a stable system (A is stable) we only need to 


consider P & W,(oo), that is, the pair (A, B) is state controllable if and only 
if the controllability Gramian 


PS fi eA BBT eA" dr (4.43) 
0 


is positive definite (P > 0) and thus has full rank. P may also be obtained as 
the solution to the Lyapunov equation 


AP +PA™ =~—BBT™ (4.44) 


In words, if a system is state controllable we can by use of its inputs u bring 
it from any initial state to any final state within any given finite time. State 
controllability would therefore seem to be an important property for control, 
but it rarely is for the following four reasons: 


1. It says nothing about how the states behave at intermediate and later 
times, e.g., it does not imply that one can hold (as t + oo) the states at a 
given value. 

2. The required inputs may be very large with sudden changes. 

. Some of the states may be of no practical importance 

4. The definition is an existence result which provides no degree of 
controllability (see Hankel singular values for this). 


eo 


The first two objections are illustrated in the following example. 
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(a) Input trajectory to give desired state at t = 400 s. 
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(b) Response of states (tank temperatures). 


Figure 4.1: State controllability of four first-order systems in series. 


Example 4.5 Controllability of tanks in series 

Consider a system with one input and four states arising from four first-order 
systems in series, G(s) = 1/(rs+1)*. A physical example could be four identical 
tanks (e.g., bath tubs) in series where water flows from one tank to the next. 


Energy balances, assuming no heat loss, yield Ty = nls, T3 = sn, T2 = 
ann, TT = nto where the statesx=[T, To T3 Tel? are the four tank 


temperatures, the input u = To is the inlet temperature, and tT = 100s is the residence 
time in each tank. A state-space realization is 


[are 0 0 0 ] eae 

0.01 —0.01 0 0 0 

aS 0 0.01 —0.01 0 os 0 2) 
0 0 0.01 —0.01 0 


In practice, we know that it is very difficult to control the four temperatures 
independently, since at steady-state all temperatures must be equal. However, the 
controllability matrit C in (4.42) has full rank, so the system is state controllable 
and it must be possible to achieve at any given time any desired temperature in 
each of the four tanks simply by adjusting the inlet temperature. This sounds almost 
too good to be true, so let us consider a specific case. Assume that the system is 
initially at steady-state (all temperatures are zero), and that we want to achieve at 
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t = 400 s the following temperatures: T,(400) = 1, T2(400) = —1, T3(400) = 1 and 
T1(400) = —1. The change in inlet temperature, To(t), to achieve this was computed 
from (4.40) and is shown as a function of time in Figure 4.1 (a). The corresponding 
tank temperatures are shown in Figure 4.1 (b). Two things are worth noting: 


1. The required change in inlet temperature is more than 100 times larger than the 
desired temperature changes in the tanks and it also varies widely with time. 

2. Although the tank temperatures are indeed at their desired values of +1 att = 400s 
they quickly diverge from these values for t > 400s. (Since To is reset to 0 at 
t = 400s, all temperatures will eventually approach 0 as t > co). 


Tt is quite easy to explain the shape of the input To(t): The fourth tank is furthest 
away and we want its temperature to decrease (T4(400) = —1) and therefore the 
inlet temperature To is initially decreased to about —40. Then, since T3(400) = 1 
is positive, To is increased to about 30 at t = 220s; it is subsequently decreased to 
about —40, since T.(400) = —1, and finally increased to more than 100 to achieve 
T; (400) = 1. 


From the above example, we see clearly that the property of state 
controllability may not imply that the system is “controllable” in a practical 
sense 7. This is because state controllability is concerned only with the value 
of the states at discrete values of time (target hitting), while in most practical 
cases we want the outputs to remain close to some desired value (or trajectory) 
for all values of time, and without using inappropriate control signals. 

So now we know that state controllability does not imply that the system is 
controllable from a practical point of view. But what about the reverse: If we 
do not have state controllability, is this an indication that the system is not 
controllable in a practical sense? In other words, should we be concerned if a 
system is not state controllable? In many cases the answer is “no”, since we 
may not be concerned with the behaviour of the uncontrollable states which 
may be outside our system boundary or of no practical importance. If we 
are indeed concerned about these states then they should be included in the 
output set y. State uncontrollability will then appear as a rank deficiency in 
the transfer function matrix G(s) (see functional controllability below). 

So is the issue of state controllability of any value at all? Yes, because it 
tells us whether we have included some states in our model which we have 
no means of affecting. It also tells us when we can save on computer time 
by deleting uncontrollable states which have no effect on the output for zero 
initial conditions. 

In summary, state controllability is a system theoretical concept which 
is important when it comes to computations and realizations. However, its 
name is somewhat misleading, and most of the above discussion might have 
been avoided if only Kalman, who originally defined state controllability, had 
used a different terminology. For example, better terms might have been 


2Tn Chapter 5 we introduce a more practical concept of controllability which we call “input- 
output controllability” 


136 MULTIVARIABLE FEEDBACK CONTROL 


“pointwise controllability” or “state affect-ability” from which it would have 
been understood that although all the states could be individually affected, 
we might not be able to control them independently over a period of time. 


Definition 4.2 State observability The dynamical system & = Ax + Bu, 
y = Cr+ Du (or the pair (A,C)) is said to be state observable if, for any time 
t, > 0, the initial state x(0) = x can be determined from the time history of 
the input u(t) and the output y(t) in the interval [0, ti]. Otherwise the system, 
or (A,C), is said to be state unobservable. 


The system (A, C) is state observable if and only if the observability matrix 


C 
CA 


oe (4.46) 


can} 
has rank n (full column rank). We also have that the system (A,C) is state 
observable if and only if the Gramian matrix 


t 
W.(t) & i eA"CTCeA dr (4.47) 
ft) 
has full rank (and thus is positive definite) for any t > 0. For a stable 
system (A is stable), we need only consider W,(co), where Q = W,(cv) is 
the observability Gramian 


ge | eA" CT Ce dr (4.48) 
0 


Q can also be found as the solution to the following Lyapunov equation 
A7Q4+QA=-C'™C (4.49) 


A system is state observable if we can obtain the value of all individual states 
by measuring the output y(t) over some time period. However, even if a system 
is state observable it may not be observable in a practical sense. For example, 
obtaining x(0) may require taking high-order derivatives of y(t) which may 
be numerically poor and sensitive to noise. This is illustrated in the following 
example. 


Example 4.5 continued. If we define y = Ty (the temperature of the last tank), 
thenC =[0 0 0 1] and we find that the observability matriz O has full column 
rank so all states are observable from y. However, consider a case where the initial 
temperatures in the tanks, T:(0),i =1,...,4, are nonzero (and unknown), and the 
inlet temperature To(t) = u(t) is zero fort >0. Then, from a practical point of view, 
it is clear that it is numerically very difficult to back-calculate, for example T;(0) 
based on measurements of y(t) = Ty(t) over some interval [0, t1], although in theory 
all states are observable from the output. 
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Definition 4.3 Minimal realization, McMillan degree and hidden 
mode _ A state-space realization (A, B,C,D) of G(s) is said to be a minimal 
realization of G(s) if A has the smallest possible dimension (i.e. the fewest 
number of states). The smallest dimension is called the McMillan degree of 
G(s). A mode is hidden if it is not state controllable or observable and thus 
does not appear in the minimal realization. 


Since only controllable and observable states contribute to the input-output 
behaviour from u to y, it follows that a state-space realization is minimal 
if and only if (A,B) is state controllable and (A,C) is state observable. 
Note that, uncontrollable states will contribute to the output response y(t) 
if e(t = 0) F 0, but their effect will die out if the uncontrollable states 
are stable.Unobservable states have no effect on the outputs whatsoever, and 
may be viewed as outside the system boundary, and thus of no interest from 
a control point of view. One possible exception is for unstable hidden modes, 
because we probably want to avoid the system “blowing up”; see remarks 
following Definition 4.5. 


4.3 Stability 


There are a number of possible ways in which stability may be defined, see 
Willems (1970). Fortunately, for linear time-invariant systems these differences 
do not matter, and we use the following definition: 


Definition 4.4 A system is (internally) stable if none of its components 
contain hidden unstable modes and the injection of bounded external signals 
at any place in the system result in bounded output signals measured anywhere 
in the system. 


The word internally is included in the definition to stress that we do not only 
require the response from one particular input to another particular output to 
be stable, but require stability for signals injected or measured at any point 
of the system. This is discussed in more detail for feedback systems in Section 
4.7. Similarly, the components must contain no hidden unstable modes, that 
is, any instability in the components must be contained in their input-output 
behaviour. 


Definition 4.5 State stabilizable, state detectable and hidden 
unstable modes. A system is state stabilizable if all unstable modes are 
state controllable. A system is state detectable if all unstable modes are state 
observable. A system with unstabilizable or undetectable modes is said to 
contain hidden unstable modes. 


138 MULTIVARIABLE FEEDBACK CONTROL 


A linear system with a pair (A, B) is state stabilizable if and only if there 
exists a matrix F' such that A+ BF is stable (Hurwitz, see Theorem 4.7), 
that is, there exists a state feedback u = F'x such that the system is stable. 
If a system is not stabilizable then no such F exists. Similarly, a pair (A, C) 
is state detectable if and only if there exists a matrix D such that A+ LC is 
stable (Hurwitz). If a system is not detectable, then there is a state within 
the system which will eventually grow out of bounds, but we have no way of 
observing this from the outputs y(t). 

Any unstable linear system can be stabilized by feedback control (at 
least in theory) provided the system contains no hidden unstable mode(s). 
Systems with hidden unstable modes must be avoided both in practice and in 
computations (since variables will eventually blow up on our computer if not 
on the factory floor). In the book we always assume, unless otherwise stated, 
that our systems contain no hidden unstable modes. 


4.4. Poles 


For simplicity, we here define the poles of a system in terms of the eigenvalues 
of the state-space A-matrix. More generally, the poles of G(s) may be 
somewhat loosely defined as the finite values s = p where G(p) has a 
singularity (“is infinite”), see also Theorem 4.8 below. 


Definition 4.6 Poles. The poles p; of a system with state-space description 
(4.3)-(4.4) are the eigenvalues \;(A),i =1,...,n of the matriz A. The pole or 
characteristic polynomial ¢(s) is defined as ¢(s) = det (sI—A) = JJj_, (spi). 
Thus the poles are the roots of the characteristic equation 

(8) 2 det(sf — A) =0 (4.50) 


To see that this definition is reasonable, recall (4.16). Note that if A does 
not correspond to a minimal realization then the poles by this definition 
will include the poles (eigenvalues) corresponding to uncontrollable and/or 
unobservable states. 


4.4.1 Poles and stability 


For linear systems, the poles determine stability: 


Theorem 4.7 A linear dynamic system 4 = Ax + Bu is stable if and only if 
all the poles are in the open left half plane (LHP), that is, Re{\;(A)} < 0, Vi. 
A matriz A with such a property is said to be “stable” or Hurwitz. 


Note that systems with poles on the jw-axis, including integrators, are 
defined to be unstable. To illustrate Theorem 4.7 consider a diagonalized 
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realization (Jordan form), from which we see that the time response in (4.7) 
can be written as a sum of terms each containing a mode e%'4)', also see 
Appendix A.2.2. Eigenvalues in the RHP with Re{A;(A)} > 0 give rise to 
unstable modes since in this case e‘(4)t is unbounded as t — oo. Eigenvalues 
in the open LHP give rise to stable modes where e*(4)t > 0 as t > ov. 


4.4.2 Poles from state-space realizations 


Poles are usually obtained numerically by computing the eigenvalues of the A- 
matrix. To get the fewest number of poles we should use a minimal realization 
of the system. 


4.4.3. Poles from transfer functions 


The following theorem from MacFarlane and Karcanias (1976) allows us to 
obtain the poles directly from the transfer function matrix G(s) and is also 
useful for hand calculations. It also has the advantage of yielding only the 
poles corresponding to a minimal realization of the system. 


Theorem 4.8 The pole polynomial $(s) corresponding to a minimal 
realization of a system with transfer function G(s) is the least common 
denominator of all non-identically-zero minors of all orders of G(s). 


A minor of a matrix is the determinant of the matrix obtained by deleting 
certain rows and/or columns of the matrix. We will use the notation M7? to 
denote the minor corresponding to the deletion of rows r and columns c in 
G(s). In the procedure defined by the theorem we cancel common factors 
in the numerator and denominator of each minor. It then follows that only 
observable and controllable poles will appear in the pole polynomial. 
Example 4.6 Consider the plant: G(s) = Gori’ e** which has no state-space 
realization as it contains a delay and is also improper. Thus we can not compute 
the poles from (4.50). However from Theorem 4.8 we have that the denominator is 
(s+1) and as expected G(s) has a pole at s = —1. 


Example 4.7 Consider the square transfer function matriz 


1 s—l1 8 
Gs) = 4.51 

(9) = To5 4 164d) 6 A (4.51) 
The minors of order 1 are the four elements which all have (s + 1)(s + 2) in the 
denominator. The minor of order 2 is the determinant 


(s — 1)(s — 2) +6s 1 


det G(s) = 1.252(s Te 1)2(s Ae 2)2 = 1.252(s + 1)(s + 2) 


(4.52) 
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Note the pole-zero cancellation when evaluating the determinant. The least common 
denominator of all the minors is then 


6(s) = (s +1)(s +2) (4.53) 


50 a minimal realization of the system has two poles: one at s = —1 and one at 
s=-2. 


Example 4.8 Consider the 2 x 3 system, with 3 inputs and 2 outputs, 


G(s) = 1 (s —1)(s +2) 0 (s—1)? 
(s+ 1)(s + 2)(s— 1) (s+1)(s+2) (s—1)(s+1) (s—-1)(s4+1) 
(4.54) 
The minors of order 1 are the five non-zero elements (e.g., M33 = gii1(s)): 
1 s—1 —1 1 1 
4. 
s+1’ (s+1)(s+2)’ s—1’ s+2’ s+2 (4:95) 
The minor of order 2 corresponding to the deletion of column 2 is 
ee (s —1)(s + 2)(s—1)(s +1) + (8 +1)(s+2)(s—1)? _ 2 
: ((s + 1)(s + 2)(s — 1)? (s + 1)(s + 2) 
(4.56 
The other two minors of order two are 
—(s—1) 1 
M, = Ms = 4.57 
‘(st 1(st2)2? “9 (+1842) ( 
By considering all minors we find their least common denominator to be 
$(s) = (8 +1)(s +.2)*(s—1) (4.58 
The system therefore has four poles: one at s = —1, one at s = 1 and two at s = —2. 


From the above examples we see that the MIMO-poles are essentially the 
poles of the elements. However, by looking at only the elements it is not 
possible to determine the multiplicity of the poles. For example, let Go(s) be 


a square 2 x 2 transfer function matrix with no pole at s = —a, and consider 
G(s) = ——Go(s) (4.59) 
i $s ! 
s+a 
How many poles at s = —a does a minimal realization of G(s) have? Note 


that if G(s) is an m x m matrix, then from (A.9), 


= det (Go(s)) (4.60) 


det (G(s)) = det (2 Gos)) = 


Therefore if Go has no zeros at s = —a, then G(s) has m poles at s = —a. 
Thus, the answer is that G(s) in (4.59) may have two poles at s = —a (as 
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for G(s) in (3.77), or T(s) in (3.74)), one pole at s = —a (as in (4.51) where 
det Go(s) has a zero at. s = —a) or no pole at s = —a (if all the elements of 
Go(s) have a zero at s = —a). 

As noted above, the poles are obtained numerically by computing the 
eigenvalues of the A-matrix. Thus, to compute the poles of a transfer function 
G(s), we must first obtain a state-space realization of the system. Preferably 
this should be a minimal realization. For example, if we make individual 
realizations of the five non-zero elements in Example 4.8 and then simply 
combine them to get an overall state space realization, we will get a system 
with 15 states, where each of the three poles (in the common denominator) 
are repeated five times. A model reduction to obtain a minimal realization 
will subsequently yield a system with four poles as given in (4.58). 


4.5 Zeros 


Zeros of a system arise when competing effects, internal to the system, are 
such that the output is zero even when the inputs (and the states) are not 
themselves identically zero. For a SISO system the zeros z; are the solutions 
to G(z) = 0. In general, it can be argued that zeros are values of s at which 
G(s) loses rank (from rank 1 to rank 0 for a SISO system). This is the basis 
for the following definition of zeros for a multivariable system (MacFarlane 
and Karcanias, 1976). 


Definition 4.9 Zeros. z; is a zero of G(s) if the rank of G(z;) is less than 
the normal rank of G(s). The zero polynomial is defined as z(s) = T];2,(s—2:) 
where nz is the number of finite zeros of G(s). 


In this book we do not consider zeros at infinity; we require that z; is finite. 
Recall that the normal rank of G(s) is the rank of G(s) at all values of s 
except at a finite number of singularities (which are the zeros). Note that this 
definition of zeros is based on the transfer function matrix, corresponding 
to a minimal realization of a system. These zeros are sometimes called 
“transmission zeros”, but we will simply call them “zeros”. We may sometimes 
use the term “multivariable zeros” to distinguish them from the zeros of the 
elements of the transfer function matrix. 


4.5.1 Zeros from state-space realizations 


Zeros are usually computed from a state-space description of the system. First 
note that the state-space equations of a system may be written as 


rofs]=[3} rm=["e* 8] wm 
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The zeros are then the values s = z for which the polynomial system matrix, 
P(s), loses rank, resulting in zero output for some nonzero input. Numerically, 
the zeros are found as non-trivial solutions (with u, #0 and x, 4 0) to the 
following problem 


(I, — M) Fal =0 (4.62) 
A B I 0 
u=|é al ial | (4.63) 
This is solved as a generalized eigenvalue problem — in the conventional 


eigenvalue problem we have J, = J. Note that we usually get additional zeros 
if the realization is not minimal. 


4.5.2 Zeros from transfer functions 


The following theorem from MacFarlane and Karcanias (1976) is useful for 
hand calculating the zeros of a transfer function matrix G(s). 


Theorem 4.10 The zero polynomial z(s), corresponding to a minimal 
realization of the system, is the greatest common divisor of all the numerators 
of all order-r minors of G(s), where r is the normal rank of G(s), provided that 
these minors have been adjusted in such a way as to have the pole polynomial 
o(s) as their denominators. 


Example 4.9 Consider the 2 x 2 transfer function matriz 


G(s) = = Pe ae ay (4.64) 


The normal rank of G(s) is 2, and the minor of order 2 is the determinant, 


det G(s) = 2s =2 a . From Theorem 4.8, the pole polynomial is ¢(s) = s+2 
and therefore the zero polynomial is z(s) = s—4. Thus, G(s) has a single RHP-zero 


ats =A. 


This illustrates that in general multivariable zeros have no relationship with 
the zeros of the transfer function elements. This is also shown by the following 
example where the system has no zeros. 


Example 4.7 continued. Consider again the 2 x 2 system in (4.51) where 
det G(s) in (4.52) already has ¢(s) as its denominator. Thus the zero polynomial 
is given by the numerator of (4.52), which is 1, and we find that the system has no 
multivariable zeros. 


The next two examples consider nonsquare systems. 
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Example 4.10 Consider the 1 x 2 system 


s—1 | (4.65) 


G(s) =| 
(s) stl s+2 
The normal rank of G(s) is 1, and since there is no value of s for which both elements 
become zero, G(s) has no zeros. 


In general, non-square systems are less likely to have zeros than square 
systems. For instance, for a square 2 x 2 system to have a zero, there must 
be a value of s for which the two columns in G(s) are linearly dependent. On 
the other hand, for a 2 x 3 system to have a zero, we need all three columns 
in G(s) to be linearly dependent. 

The following is an example of a non-square system which does have a zero. 


Example 4.8 continued.Consider again the 2 x 3 system in (4.54), and 
adjust the minors of order 2 in (4.56) and (4.57) so that their denominators are 
o(s) =(s+1)(s +2)?(s—1). We get 


: _ —(s—1)? 5) = 208 = Ws +2) 3) — SIDE +2) 
Ma(s) =, Mala) Be Mala) = SS (4.66) 
The common factor for these minors is the zero polynomial 
2(s) = (s — 1) (4.67) 


Thus, the system has a single RHP-zero located at s = 1. 

We also see from the last example that a minimal realization of a MIMO 
system can have poles and zeros at the same value of s, provided their 
directions are different. This is discussed next. 


4.6 More on poles and zeros 


4.6.1 Directions of poles and zeros 


In the following let s be a fixed complex scalar and consider G(s) as a 
complex matrix. For example, given a state-space realization, we can evaluate 
G(s) =C(sI— A) 1B+D. 

Zero directions. Let G(s) have a zero at s = z. Then G(s) loses rank at 
$ = 2, and there will exist nonzero vectors uz and y, such that 


G(z)u.=0, y@G(z) =0 (4.68) 


Here wu, is defined as the input zero direction, and y, is defined as the output 
zero direction. We usually normalize the direction vectors to have unit length, 
Le., ||wz|l2 = 1 and |ly.||2 = 1. From a practical point of view the output zero 
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direction, yz, is usually of more interest than uz, because y, gives information 
about which output (or combination of outputs) may be difficult to control. 

In principle, we may obtain u, and y, from an SVD of G(z) = USV", and 
we have that uz is the last column in V (corresponding to the zero singular 
value of G(z)) and y, is the last column of U. An example was given earlier 
in (3.60). A better approach numerically, is to obtain u, from a state-space 
description using the generalized eigenvalue problem in (4.63). Similarly, y, 
may be obtained from the transposed state-space description, using M7 in 
(4.63). 

Pole directions. Let G(s) have a pole at s = p. Then G(p) is infinite, and 
we may somewhat crudely write 


G(p)up = 00, yp G(p) = 00 (4.69) 


where wy is the input pole direction, and y, the output pole direction. As for uz 
and y., the vectors u, and y, may be obtained from an SVD of G(p) = UNV". 
Then uy is the first column in V (corresponding to the infinite singular value), 
and yp the first column in U. If the inverse of G(p) exists then it follows from 
the SVD that 

G"(p)yp =0, uj'G"(p) = 0 (4.70) 


However if we have a state-space realization of G(s), then it is better to 
determine the pole directions from the right and left eigenvectors of A 
(Havre, 1995). Specifically, if p is pole of G(s), then p is an eigenvalue of 
A. Let tp and gp be the corresponding right and left eigenvectors, i.e. 


Aty = pty, g,/ A= pai 
then the pole directions are 
Yp = Cty, Up = B" dp (4.71) 


Example 4.11 Consider the 2x2 plant in (4.64), which has a RHP-zero at z = 4 
and a LHP-pole at p= —2. We will use an SVD of G(z) and G(p) to determine the 
zero and pole directions (but we stress that this is not a reliable method numerically). 
To find the zero direction consider 


H 
1) 3 4 1|0.55 —0.83]]9.01 0] |06 —0.8 
GG) = Gia) = 6 les “| 6 ie 0.55 0 | EE 0.6 


The zero input and output directions are associated with the zero singular value of 


_ [0.80 _ [0.83 
G(z) and we get uz = 0.60 and y, = 0.55 |. We see from yz that the the 


zero has a slightly larger component in the first output. Next, to determine the pole 
directions consider 


Gp+e)=G(24+0)=5 es rere (4.72) 
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The SVD ase — 0 yields 
A 
1 [055 0.83] [9.01 o] [06 —08 
GeO 0.83 or 0 | Ee | 


The pole input and outputs directions are associated with the largest singular value, 
0.60 —0.55 
—0.80 0.83 
the pole has a slightly larger component in the second output. 


o1= 9.01/c, and we get Up = and Yp = |. We note from yp that 


Remark. It is important to note that although the locations of the poles and 
zeros are independent of input and output scalings, their directions are not. Thus, 
the inputs and outputs need to be scaled properly before attempting to make any 
interpretations based on pole and zero directions. 


4.6.2. Remarks on poles and zeros 


1. The zeros resulting from a minimal realization are sometimes called the 
transmission zeros. If one does not have a minimal realization, then numerical 
computations (e.g., using MATLAB) may yield additional invariant zeros. These 
invariant zeros plus the transmission zeros are sometimes called the system zeros. 
The invariant zeros can be further subdivided into input and output decoupling 
zeros. These cancel poles associated with uncontrollable or unobservable states 
and hence have limited practical significance. We recommend that a minimal 
realization is found before computing the zeros. 

2. Rosenbrock (1966; 1970) first defined multivariable zeros using something similar 
to the Smith-McMillan form. Poles and zeros are defined in terms of the McMillan 
form in Zhou et al. (1996). 

3. The presence of zeros implies blocking of certain input signals (MacFarlane and 
Karcanias, 1976). If z is a zero of G(s), then there exists an input signal of the 
form uze*'14(t), where u, is a (complex) vector and 14(¢) is a unit step, and a 
set of initial conditions x,, such that y(t) = 0 for ¢ > 0. 

4. For square systems we essentially have that the poles and zeros of G(s) are the 
poles and zeros of det G(s). However, this crude definition may fail in a few cases. 
For instance, when there is a zero and pole in different parts of the system which 
happen to cancel when forming det G(s). For example, the system 


(s+2)/(s+1) 0 


GG)= 0 (s +1)/(s +2) 


(4.73) 
has det G(s) = 1, although the system obviously has poles at —1 and —2 and 
(multivariable) zeros at —1 and —2. 

5. G(s) in (4.73) provides a good example for illustrating the importance of 
directions when discussing poles and zeros of multivariable systems. We note 
that although the system has poles and zeros at the same locations (at —1 and 
—2), their directions are different and so they do not cancel or otherwise interact 
with each other. In (4.73) the pole at —1 has directions up = yp = [1 0]’, 
whereas the zero at —1 has directions u. = yz =[0 1]. 
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6. For square systems with a nonsingular D-matrix, the number of poles is the same 
as the number of zeros, and the zeros of G(s) are equal to the poles G~'(s), and 
vice versa. 

7. There are no zeros if the outputs contain direct information about all the states; 
that is, if from y we can directly obtain x (eg., C = I and D = 0); see 
Example 4.14. This probably explains why zeros were given very little attention 
in the optimal control theory of the 1960’s which was based on state feedback. 

8. Zeros usually appear when there are fewer inputs or outputs than states, or when 
D £0. Consider a square m x m plant G(s) = C(sI — A)~'B+ D with n states. 
We then have for the number of (finite) zeros of G(s) (Maciejowski, 1989, p.55) 


D#0: At most n—m+rank(D) zeros (4.74) 
D=0: At most n — 2m -+rank(C’B) zeros (4.75) 
D=0 and rank(CB) =m: Exactly n — m zeros (4.76) 


9. Poles may be moved by feedback control (whereas feedforward control can 
cancel poles but not move them). For example, for a strictly proper plant 
G(s) = C(sI — A)~'B the open-loop poles are determined by the characteristic 
polynomial ¢.1:(s) = det(sI — A). If we apply constant gain negative feedback u = 
—Koy, the poles are determined by the corresponding closed-loop characteristic 
polynomial ¢.;(s) = det(s!—A+BKoC). Thus, unstable plants may be stabilized 
by use of feedback control. Also see Example 4.12. 

10. In general, the zeros of G(I-+ KG)7' are the zeros of G plus the poles of K. This 
means that zeros in G are not moved by feedback. Their effect can be counteracted 
by a cascade compensator with poles at zero origins, but such calculations are 
not possible for RHP-zeros due to internal stability (see Section 4.7). However, it 
is possible to move a zero by adding a parallel branch, such as y = (G+ K)u, but 
since y is a physical output from the plant this corresponds to adding another 
actuator. 

11. The zero location, including its output direction, is unaffected by feedback. 
However, even though y, is fixed it is still possible with feedback control to 
move the deteriorating effect of a RHP-zero to a given output channel, provided 
yz has a nonzero element for this output. This was illustrated by the example in 
Section 3.8, and is discussed in more detail in Section 6.5.1. 

12. Pinned zeros. A zero is pinned to a subset of the outputs if y, has one or more 
elements equal to zero. In most cases, pinned zeros have a scalar origin. Pinned 
zeros are quite common in practice, and their effect cannot be moved freely to 
any output. For example, the effect of a measurement delay for output y; cannot 
be moved to output y2. Similarly, a zero is pinned to certain inputs if wu, has one 
or more elements equal to zero. An example is G(s) in (4.73), where the zero at 
—2 is pinned to input wi and to output y1. 

13. Zeros of nonsquare systems. The existence of zeros for non-square systems 
is common in practice in spite of what is sometimes claimed in the literature. 
In particular, they appear if we have a zero pinned to the side of the plant 
with the fewest number of channels. As an example consider a plant with three 

hit hig his 


hai(s _ z) ha2(s = z) ho3(s = z) aviach 


inputs and two outputs Gi(s) = 
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has a zero at s = z which is pinned to output ys, ie, ye = [0 1]. This 

follows because the second row of Gi(z) is equal to zero, so the rank of Gi(z) 

is 1, which is less than the normal rank of Gi(s) which is 2. On the other hand 
_ | hiu(s—z) hi2 his 

Gal ha(s—z) hoe hes 
rank 2 which is equal to the normal rank of G2(s) (assuming that the last two 
columns of G'2(s) have rank 2). 

14. There has been some discussion about whether multivariable zeros really exist 
since they may be removed by adding extra outputs. To some extent this is 
correct. It is well-known that there are no zeros if we use all the states as 
outputs. However, to control all states independently we need as many inputs. 
Thus, by adding states as outputs to remove the zeros, we get a plant which is 
not functionally controllable. 

15. The concept of functional controllability, see page 233, is related to zeros. Loosely 


does not have a zero at s = z since G'2(z) has 


speaking, one can say that a system which is functionally uncontrollable has in 
a certain output direction “a zero for all values of s”. 


The control implications of RHP-zeros and RHP-poles are discussed for SISO 
systems on pages 185-199 and for MIMO systems on pages 235-238. 


Example 4.12 Effect of feedback on poles and zeros. Consider a SISO 
negative feedback system with plant G(s) = z(s)/¢(s) and a constant gain controller, 
K(s) =k. The closed-loop response from reference r to output y is 


L(s) kG(s) kz(s) Zet(S) 
T(s)= — = =k 4.77 
()= TFL) ~T+kG) ~ He) + has) palsy = 7) 
Note the following: 
1. The zero polynomial is za(s) = 2(s), $0 the zero locations are unchanged by 
feedback. 

2. The pole locations are changed by feedback. For example, 
k>0 = — det(s) > 9(s) (4.78) 
k—-oco => gals) > kz(s) (4.79) 


That is, as we increase the feedback gain, the closed-loop poles move from open- 
loop poles to the open-loop zeros. RHP-zeros therefore imply high gain instability. 
These results are well known from a classical root locus analysis. 


Example 4.13 Consider a SISO system G(s) = C(sI — A)~'B+ D which has 
just one state, i.e. A is a scalar. The system has a zero at z = A—(CB)/D. When 
D — 0 the zero moves to infinity, i.e., no zero. 


Example 4.14 We want to prove that G(s) = C(sI — A)~'B4+ D has no zeros if 
D =0 and rank (C') = n, where n is the number of states. Solution: Consider the 
polynomial system matrit P(s) in (4.61). The first n columns of P are independent 
because C has rank n. The last m columns are independent of s. The first n and 
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last m columns are always independent since D =0 and C has full column rank and 
thus cannot have any columns equal to zero. We need to require D = 0 because if 
D is nonzero then the first n columns of P may depend on the last m columns for 
some value of s. 


Exercise 4.3 Determine the poles and zeros of 


1153 —18s?—70s—50 (s+2) 
= s(s+10)(s+1)(s—5 s+tl1)(s—5 
G(s) = ( Deane de -«C ee ) 
(s+1)(s—5) (s+1)(s—5) 
given that 
dete = eS Se Ss) 2 a eee) 


s(s + 1)?(s+ 10)(s — 5)? s(s +1)?(s + 10)(s — 5)? 


How many poles does G(s) have? 


Exercise 4.4 Given y(s) = G(s)u(s), with G(s) = i: determine a state-space 
realization of G(s) and then find the zeros of G(s) using the generalized eigenvalue 
problem. What is the transfer function from u(s) to x(s), the single state of G(s), 


and what are the zeros of this transfer function? 


Exercise 4.5 Find the zeros for a 2 x 2 plant with A 


ll 
— 
eo 
Noe 
Boe 
ea 
Noe 
Now 
us 
wy 
ll 


1 1 
ie 1. ,C=1,D=0 


Exercise 4.6 For what values of c: does the following plant have RHP-zeros? 


_f10 0 ae Or ey _ foo 
a=[ S]sqne=[2 3].0-[2 9] aso 


Exercise 4.7 Consider the plant in (4.80), but assume that both states are 
measured and used for feedback control, i.e. Ym = « (but the controlled output is 
still y = Ca + Du). Can a RHP-zero in G(s) give problems with stability in the 
feedback system? Can we achieve “perfect” control of y in this case? (Answers: No 
and no). 


4.7 Internal stability of feedback systems 


To test for closed-loop stability of a feedback system it is usually enough to 
check just one closed-loop transfer function, e.g. S(s). However, this assumes 
that there are no internal RHP pole-zero cancellations between the controller 
and the plant. The point is best illustrated by an example. 
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Example 4.15 Consider the feedback system shown in Figure 4.2 where G(s) = 


=, K(s) = Rath. In forming the loop transfer function L = GK we cancel the 


term (s —1), a RHP pole-zero cancellation, to obtain 


8 
stk 


PS” Bae rae S (4.81) 
8 

S(s) is stable, that is, the transfer function from dy to y is stable. However, the 

transfer function from dy to u is unstable: 


k(s +1) 


us -K (I+ GRY dy = - Sy 


(4.82) 
Consequently, although the system appears to be stable when considering the output 
signal y, it is unstable when considering the “internal” signal u, so the system is 
(internally) unstable. 


Figure 4.2: Internally unstable system 


Remark. In practice, it is not possible to cancel exactly a plant zero or pole because 
of modelling errors. In the above example, therefore, L and S will also be unstable. 
However, it is important to stress that even if we could achieve a perfect RHP 
pole-zero cancellation, as in the above example, we would still get an internally 
unstable system. This is a subtle but important point. In this ideal case the state- 
space descriptions of £ and S contain an unstable hidden mode corresponding to an 
unstabilizable or undetectable state. 


From the above example, it is clear that to be rigorous we must consider 
internal stability of the feedback system, see Definition 4.4. To this effect 
consider the system in Figure 4.3 where we inject and measure signals at both 
locations between the two components, G and Kk. We get 


u=(I+KG)"'d, — K(I+ GK)~"d, (4.83) 


y= GI+ KG)"'dy + (1+ GK)" dy (4.84) 


The theorem below follows immediately: 
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Figure 4.3: Block diagram used to check internal stability of feedback system 


Theorem 4.11 The feedback system in Figure 4.3 is internally stable if 
and only if all four closed-loop transfer matrices in (4.83) and (4.84) are 
stable. 


In addition, we assume as usual that the components G and K contain 
no unstable hidden modes. The signal relationship in the block diagram of 
Figure 4.3 may also be written as (see Exercise 4.8) 


-1 
u du |. _| Il #K 
[Yaaro [4]: aner= [ty *] 485 
and we get that the system is internally stable if and only if M(s) is stable. 
If we disallow RHP pole-zero cancellations between system components, 
such as G and K, then stability of one of the four closed-loop transfer functions 
implies stability of the other three. This is stated in the following theorem. 


Theorem 4.12 Assume there are no RHP pole-zero cancellations between 
G(s) and K(s), that is, all RHP-poles in G(s) and K(s) are contained in the 
minimal realizations of GK and KG. Then the feedback system in Figure 4.3 
is internally stable if and only if one of the four closed-loop transfer function 
matrices in (4.83) and (4.84) is stable. 


Proof: A detailed proof is rather involved. For more details the reader is referred to 
Zhou et al. (1996, p.125). 


Note how we define pole-zero cancellations in the above theorem. In this 
way, RHP pole-zero cancellations resulting from G or K not having full normal 
rank are also disallowed. For example, with G(s) = 1/(s — a) and K =0 we 
get GK =0so the RHP-pole at s = a has disappeared and there is effectively 
a RHP pole-zero cancellation. In this case, we get $(s) = 1 which is stable, 
but internal stability is clearly not possible. 
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Exercise 4.8 Use (A.6) to show that M(s) in (4.85) is identical to the block 
transfer function matriz implied by (4.83) at (4.84). 


4.7.1. Implications of the internal stability requirement 


The requirement of internal stability is a feedback system leads to a number 
of interesting results, some of which are investigated below. Note in particular 
Exercise 4.11 where we discuss alternative ways of implementing a two degrees- 
of-freedom controller. 

We first prove the following statements which apply when the overall 
feedback system is internally stable (Youla, Jabr and Lu, 1974): 


1. If G(s) has a RHP-zero at z, then L = GK,T = GK(I+ GK)", 
SG = (I+ GK)'G, L; = KG and T; = KG(I + KG)! will each 
have a RHP-zero at z. 

2. If G(s) has a RHP-pole at p, then L = GK and Lr = KG also have a RHP- 
pole at p, while § = (I+GK)~!, KS = K(I+GK)"! and S; = (I+ KG)" 
have a RHP-zero at p. 


Proof of 1: To achieve internal stability RHP pole-zero cancellations between system 
components, such as G and K, are not allowed. Thus L = GK must have a RHP- 
zero when G has a RHP-zero. Now S is stable and thus has no RHP-pole which can 
cancel the RHP-zero in L, and so T = LS must have a RHP-zero at z. Similarly, 
SG = (I+ GK)~'G must have a RHP-zero, ete. 


Proof of 2: Clearly, L has a RHP-pole at p. Since T is stable, it follows from T = LS 
that S must have a RHP-zero which exactly cancels the RHP-pole in J, etc. 


We notice from this that a RHP pole-zero cancellation between two transfer 
functions, such as between L and S = (I + L)~1, does not necessarily imply 
internal instability. It is only between separate physical components that RHP 
pole-zero cancellations are not allowed. 


Exercise 4.9 Interpolation constraints. For internal stability prove the 
following interpolation constraints which apply for SISO feedback systems when the 
plant G(s) has a RHP-zero z or a RHP-pole p: 


G(z) =0 L(z)=0 © T(z) =0,S(z)=1 (4.86) 


G"'(p) =0 L(p)=co  T(p) =1,S(p) =0 (4.87) 


Exercise 4.10 Given the complementary sensitivity functions 


2s+1 
s2+0.85+1 


—2Qs+4+1 


T; pa, 
(s) 2 4+085+1 


T2(s) => 


what can you say about possible RHP-poles or RHP-zeros in the corresponding loop 
transfer functions, L1(s) and L2(s)? 
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A discussion of the significance of these interpolation constraints is relevant. 
Recall that for “perfect control” we want S = 0 and T = 1. We note from 
(4.86) that a RHP-zero z puts constraints on S and T which are incompatible 
with perfect control. On the other hand, the constraints imposed by the RHP- 
pole are consistent with what we would like for perfect control. Thus the 
presence of RHP-poles mainly impose problems when tight (high gain) control 
is not possible. We discuss this in more detail in Chapters 5 and 6. 

The following example demonstrates another application of the internal 
stability requirement. 


Figure 4.4: Different forms of two degrees-of-freedom controller 
(a) General form. 
(b) Suitable when K,(s 
(c) Suitable when K,( 
(d) Suitable when K,(s) = Ki(s)Ko(s) where K1(s) contains 
no RHP-zeros and K.2(s) no RHP poles. 


) has no RHP-zeros. 
8) is stable (no RHP-poles). 
) 


Exercise 4.11 Internal stability of two degrees-of-freedom control 
configurations. A two degrees-of-freedom controller allows one to improve 
performance by treating disturbance rejection and command tracking separately (at 
least to some degree). The general form shown in Figure 4.4(a) is usually preferred 
both for implementation and design. However, in some cases one may want to first 
design the pure feedback part of the controller, here denoted Ky(s), for disturbance 
rejection, and then to add a simple precompensator, Kr(s), for command tracking. 
This approach is in general not optimal and may also yield problems when it comes 
to implementation, in particular, if the feedback controller K,(s) contains RHP poles 
or zeros, which can happen. This implementation issue is dealt with in this exercise 
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by considering the three possible schemes in Figure 4.4 (b)-4.4 (d). In all these 
schemes K, must clearly be stable. 

1) Explain why the configuration in Figure 4.4 (b) should not be used if Ky 
contains RHP-zeros (Hint: Avoid a RHP-zero between r and y). 

2) Explain why the configuration in Figure 4.4c should not be used if Ky contains 
RHP-poles. This implies that this configuration should not be used if we want integral 
action in Ky (Hint: Avoid a RHP-zero between r and y). 

3) Show that for a feedback controller Ky the configuration in Figure 4.4 (d) may 
be used, provided the RHP-poles (including integrators) of Ky are contained in ky 
and the RHP-zeros in Ky. Discuss why one may often set K, = I in this case (to 
give a fourth possibility). 


The requirement of internal stability also dictates that we must exercise 
care when we use a separate unstable disturbance model G(s). To avoid this 
problem one should for state space computations use a combined model for 
inputs and disturbances, i.e. write the model y = Gu + Gad in the form 


y=(6 Gul["] 


where G and Gy share the same states, see (4.14) and (4.17). 


4.8 Stabilizing controllers 


In this section, we introduce a parameterization, known as the Q- 
parameterization or Youla-parameterization (Youla, Jabr and Bongiorno, 
1976) of all stabilizing controllers for a plant. By all stabilizing controllers we 
mean all controllers that yield internal stability of the closed-loop system. We 
first consider stable plants, for which the parameterization is easily derived, 
and then unstable plants where we make use of the coprime factorization. 


4.8.1 Stable plants 


Theorem 4.13 For a stable plant G(s) the negative feedback system in 
Figure 4.3 is internally stable if and only if Q = K(I+GK)~? is stable. 


Proof: The four transfer functions in (4.83) and (4.84) are easily shown to be 


K(I+GK)'=Q (4.88 
(I+ GK)"'=I-—GQ (4.89 
(I+ KG)"' =I-—QG (4.90 
G(I+ KG)' =G(I- Qa) (4.91 
which are clearly all stable if G and Q are stable. Thus, with G stable the system is 
internally stable if and only if Q is stable. 
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As proposed by Zames (1981), by solving (4.88) with respect to the controller 
K, we obtain that a parameterization of all stabilizing negative feedback 
controllers for the stable plant G(s) is 


K =(I-QG)"Q = Q(I-GQ)* (4.92) 


where the “parameter” Q is any stable transfer function matriz. If only proper 
controllers are allowed then Q must be proper since the term (I — QG)~1 is 
semi-proper. 

Note: We have shown that by varying Q freely (but stably) we will always 
have internal stability, and thus avoid internal RHP pole-zero cancellations 
between K and G. This means that although Q may generate unstable 
controllers K, there is no danger of getting a RHP-pole in K that cancels 
a RHP-zero in G. 

The parameterization in (4.92) is identical to the internal model control 
(IMC) parameterization (Morari and Zafiriou, 1989) of stabilizing controllers. 
It may be derived directly from the IMC structure given in Figure 4.5. The 
idea behind the IMC-structure is that the “controller” Q can be designed in an 
open-loop fashion since the feedback signal only contains information about 
the difference between the actual output and the output predicted from the 
model. 


Figure 4.5: The internal model control (IMC) structure. 


Exercise 4.12 Show that the IMC-structure in Figure 4.5 is internally unstable if 
either Q or G is unstable. 


Exercise 4.13 Show that testing internal stability of the IMC-structure is 
equivalent to testing for stability of the four closed-loop transfer functions in (4.88- 


4.91). 
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Exercise 4.14 Given a stable controller K. What set of plants can be stabilized 
by this controller? (Hint: interchange the roles of plant and controller.) 


4.8.2 Unstable plants 


For an unstable plant G(s), consider its coprime factorization as described in 
(4.18) and (4.19) 
G(s) = N,M,1! = M,'N, (4.93) 


A parameterization of all stabilizing negative feedback controllers for the plant 
G(s) is then (Vidyasagar, 1985) 


K(s) = (Ve — QW)! (Ur + QM1) (4.94) 


where V, and U, satisfy the Bezout identity (4.19) for the right coprime 
factorization, and Q(s) is any stable transfer function satisfying the technical 
condition det (V;(co) — Q(co).Ni(co)) £0. 


Remark 1 With Q = 0 we have Ko = Vo1U;,, so V, and U, can alternatively be 
obtained from a left coprime factorization of some initial stabilizing controller Ko. 


Remark 2 For a stable plant, we may write G(s) = N,(s) = Ni(s) corresponding 
to M, = M = I. In this case U,; = 0 and V, = I satisfies the Bezout identity (4.19), 
and (4.94) yields K = (I — QG)~'@ as found before in (4.92). 


Remark 3 All closed-loop transfer functions (S, T, etc.) will be in the form 
H, + HoQHs, so they are affine? in Q. This can be useful when Q is varied to 
minimize the norm of some closed-loop transfer function. 


Remark 4 We can also formulate the parameterization of all stabilizing controllers 
in state-space form, e.g. see page 312 in Zhou et al. (1996) for details. 


4.9 Stability analysis in the frequency domain 


As noted above the stability of a linear system is equivalent to the system 
having no poles in the closed right-half plane (RHP). This test may be used 
for any system, be it open-loop or closed-loop. In this section we will study 
the use of frequency-domain techniques to derive from the open-loop transfer 
matrix L(jw), information about closed-loop stability. This provides a direct 
generalization of Nyquist’s stability test for SISO systems. 

Note that when we talk about eigenvalues in this section, we refer to the 
eigenvalues of a complex matrix, usually L(jw) = GK (jw), and not those of 
the state matrix A. 


3K function f(a) is affine in « if f(x) = a+ bz it is linear in « if a =0. 
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4.9.1 Open and closed-loop characteristic polynomials 


Figure 4.6: Negative feedback system. 


Before deriving a multivariable Nyquist condition we need some preliminary 
results involving the determinant of the return difference operator (I+ L(jw)). 
Consider the feedback system shown in Figure 4.6, where L(s) is the loop 
transfer function matrix. Stability of the open-loop system is determined by 


Aoi Ba : 
Co | Doi | that is 


the poles of L(s). If L(s) has a state-space realization | 
L(s) = Cols = Aot)~' Bot + Da (4.95) 


then the poles of L(s) are the roots of the open-loop characteristic polynomial 


do (s) = det(sI — Aoi) (4.96) 

Assume there are no RHP pole-zero cancellations between G(s) and K‘(s). 

Then from Theorem 4.12 internal stability of the closed-loop system is 

equivalent to the stability of $(s) = (I+ L(s))~1. The state matrix of $(s) is 
given (assuming L(s) is well-posed i.e. Day 4 —I) by 

Aa os Aot os Ba (I + Da) Coa (4.97) 


This equation may be derived by writing down the state-space equations for 
the transfer function from r to y in Figure 4.6 


f= Agit “ie Ba (r = y) (4.98) 


y = Cox + Do lr —y) (4.99) 


and using (4.99) to eliminate y from (4.98). The closed-loop characteristic 
polynomial is thus given by 


ger(s) 2 det(sI — Aq) = det(sI — Ag + Bo(I+ Dor) 'Cor) (4-100) 
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Relationship between characteristic polynomials 


The above identities may be used to express the determinant of the return 
difference operator, J + L, in terms of ¢-1(s) and @o:(s). From (4.95) we get 


det (I + L(s)) = det (I + Ca(st =: Aa) ' Bat + Do) (4.101) 


Schur’s formula (A.13) then yields (with Ayy = I+ Do, Aig = —Cot, Aon = 
sl — Agi, Aor = Ba) 
= det(8) nee 


doi(s) 
where c = det(I+D,,) is a constant which is of no significance when evaluating 
the poles. Note that ¢,1(s) and ¢(s) are polynomials in s which have zeros 
only, whereas det(I + L(s)) is a transfer function with both poles and zeros. 
From (4.102), assuming no cancellations between d.i(s) and ¢1(s), we have 
that the closed-loop poles are solutions to 


det (I + L(s)) (4.102) 


det (I + L(s)) =0 (4.103) 


Example 4.16 We will rederive expression (4.102) for SISO systems. Let L(s) = 
ko The sensitivity function is given by 


a Pot(s) 
80) =F), Ge eae) G07) 


and the denominator is 
kz(s) 
doi(8) 


which is the same as ¢i(s) in (4.102) (except for the constant c which is necessary 
to make the leading coefficient of da(s) equal to 1, as required by its definition). 


d(s) = k2(s) + boils) = boi(s)(1 


) = Goi(s)(1 + L(s)) (4.105) 


Remark. One may be surprised to see from (4.104) that the zero polynomial of 
S(s) is equal to the open-loop pole polynomial, ¢.:(s), but this is indeed correct. On 
the other hand, note from (4.77) that the zero polynomial of T(s) = L(s)/(1+ L(s)) 
is equal to z(s), the open-loop zero polynomial. 


Expression (4.102) for det(I + L(s)) enables a straightforward generalization 
of Nyquist’s stability condition to multivariable systems. This in turn is very 
useful in developing robustness tests and so will be considered next in some 
detail. 


4.9.2 MIMO Nyquist stability criteria 


We will consider the negative feedback system of Figure 4.6, and assume 
there are no internal RHP pole-zero cancellations in the loop transfer function 


158 MULTIVARIABLE FEEDBACK CONTROL 
ATm 
0 
> 
Re 


Figure 4.7: Nyquist D-contour for system with no open-loop jw-axis poles. 


matrix L(s), ie. L(s) contains no unstable hidden modes. Then the generalized 
(or MIMO) Nyquist theorem, which allows us to evaluate closed-loop stability 
from the frequency response of L(jw), can be stated as follows. 


Theorem 4.14 Generalized Nyquist theorem. 

Let P, denote the number of open-loop unstable poles in L(s). The closed- 
loop system with loop transfer function L(s) and negative feedback is stable if 
and only if the Nyquist plot of det(I + L(s)) 

i) makes Pq, anti-clockwise encirclements of the origin, and 

ii) does not pass through the origin. 


Remark 1 By “Nyquist plot of det(/+L(s))” we mean “the image of det(I+L(s)) 
as s goes clockwise around the Nyquist D-contour”. The Nyquist D-contour includes 
the entire jw-axis (s = jw) and an infinite semi-circle into the right-half plane as 
illustrated in Figure 4.7. The D-contour must also avoid locations where L(s) has 
jw-axis poles by making small indentations (semi-circles) around these points. 


Remark 2 In the following we define for practical reasons unstable poles or RHP- 
poles as poles in the open RHP, excluding the jw-axis. In this case the Nyquist 
D-contour should make a small semicircular indentation into the RHP at locations 
where L(s) has jw-axis poles, thereby avoiding the extra count of encirclements due 
to jw-axis poles. 


Remark 3 Another practical way of avoiding the indentation is to shift all jw-axis 
poles into the LHP, for example, by replacing the integrator 1/s by 1/(s +6) where 
€ is a small positive number. 


Remark 4 We see that for stability det(J + L(jw)) should make no encirclements 
of the origin if L(s) is open-loop stable, and should make P,; anti-clockwise 
encirclements if L(s) is unstable. If this condition is not satisfied then the number of 
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Figure 4.8: Typical Nyquist plot of 1+ det L(jw). 


closed-loop unstable poles of (J + L(s))~! is Pa = N + Py where N is the number 
of clockwise encirclements of the origin by the Nyquist plot of det(I + L(jw)), see 
(4.108). 


Remark 5 For any real system, [(s) is proper and so to plot det(J + L(s)) as s 
traverses the D-contour we need only consider s = jw along the imaginary axis. 
This follows since lims—oo L(s) = Dou is finite, and therefore for s = co the Nyquist 
plot of det(J + L(s)) converges to det(J + D 1) which is on the real axis. 


Remark 6 In many cases L(s) contains integrators so for w = 0 the plot of 
det(J + L(jw)) may “start” from +joo. A typical plot for positive frequencies is 
shown in Figure 4.8 for the system 

3(—2s + 1) 12.78 +1 


K=1.14 41 
(5s + 1)(10s + 1)’ 12.78 C00) 


L=GK, G= 


Note that the solid and dashed curves (positive and negative frequencies) need to 
be connected as w approaches 0, so there is also a large (infinite) semi-circle (not 
shown) corresponding to the indentation of the D-contour into the RHP at s = 0 
(the indentation is to avoid the integrator in L(s)). To find which way the large 
semi-circle goes one can use the rule (based on conformal mapping arguments) that 
a right-angled turn in the D-contour will result in a right-angled turn in the Nyquist 
plot. It then follows for the example in (4.106) that there will be an infinite semi- 
circle into the RHP. There are therefore no encirclements of the origin. Since there 
are no open-loop unstable poles (jw-axis poles are excluded in the counting), Po: = 0, 
and we conclude that the closed-loop system is stable. 


Proof of Theorem 4.14: The proof makes use of the following result from complex 
variable theory (Churchill, Brown and Verhey, 1974): 
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Lemma 4.15 Argument Principle. Consider a (transfer) function f(s) and let 
C denote a closed contour in the compler plane. Assume that 


1. f(s) is analytic along C, that is, f(s) has no poles on C. 
2. f(s) has Z zeros inside C. 
3. f(s) has P poles inside C. 


Then the image f(s) as the complex argument s traverses the contour C once in a 
clockwise direction will make Z — P clockwise encirclements of the origin. 


Let N(A, f(s), C) denote the number of clockwise encirclements of the point A 
by the image f(s) as s traverses the contour C' clockwise. Then a restatement of 
Lemma 4.15 is 

N(0, f(s),C) =Z—P (4.107) 
We now recall (4.102) and apply Lemma 4.15 to the function f(s) = det(I+ L(s)) = 
us) -c selecting C to be the Nyquist D-contour. We assume c = det(J + D1) #0 
since otherwise the feedback system would be ill-posed. The contour D goes along 
the jw-axis and around the entire RHP, but avoids open-loop poles of L(s) on the 
jw-axis (where ¢oi(jw) = 0) by making small semi-circles into the RHP. This is 
needed to make f(s) analytic along D. We then have that f(s) has P = P,; poles 
and Z = P. zeros inside D. Here P., denotes the number of unstable closed-loop 
poles (in the open RHP). (4.107) then gives 


N (0, det(I + L(s)),D) = Pa — Pot (4.108) 


Since the system is stable if and only if P.; = 0, condition i) of Theorem 4.14 follows. 
However, we have not yet considered the possibility that f(s) = det(/ + L(s)), and 
hence ¢.(s), has zeros on the D-contour itself, which will also correspond to a 
closed-loop unstable pole. To avoid this det(J + L(jw)) must not be zero for any 
value of w and condition ii) in Theorem 4.14 follows. 


Example 4.17 SISO stability conditions. Consider an open-loop stable SISO 
system. In this case, the Nyquist stability condition states that for closed-loop stability 
the Nyquist plot of 1+ L(s) should not encircle the origin. This is equivalent to the 
Nyquist plot of L(jw) not encircling the point —1 in the complex plane 


4.9.3. Eigenvalue loci 


The eigenvalue loci (sometimes called characteristic loci) are defined as the 
eigenvalues of the frequency response of the open-loop transfer function, 
\;i(L(jw)). They partly provide a generalization of the Nyquist plot of L(jw) 
from SISO to MIMO systems, and with them gain and phase margins can be 
defined as in the classical sense. However, these margins are not too useful 
as they only indicate stability with respect to simultaneous changes in all the 
loops. Therefore, although characteristic loci were well researched in the 70’s 
and greatly influenced the British developments in multivariable control, e.g. 
see Postlethwaite and MacFarlane (1979), they will not be considered further 
in this book. 
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4.9.4 Small gain theorem 


The small gain theorem is a very general result which we will find useful 
in the book. We present first a generalized version of it in terms of the 
spectral radius, p(L(jw)), which at each frequency is defined as the maximum 
eigenvalue magnitude 


p(L(jw)) = max |i (L(jw))| (4.109) 


Theorem 4.16 Spectral radius stability condition. Consider a system 
with a stable loop transfer function L(s). Then the closed-loop system is stable 
if 

p(L(jw)) <1 Vw (4.110) 


The result is quite intuitive, as it simply says that if the system gain is less 
than 1 in all directions (all eigenvalues) and for all frequencies (Vw), then all 
signal deviations will eventually die out, and the system is stable. 

In general, the spectral radius theorem is conservative because phase 
information is not considered. For SISO systems p(L(jw)) = |L(jw)|, so the 
above stability condition requires that |L(jw)| < 1 at all frequencies. This is 
clearly conservative, since from the Nyquist stability condition for a stable 
L(s), we need only require |L(jw)| < 1 at frequencies where the phase of 
L(jw) is —180° + n 360°. As an example, let L = k/(s +). Since the phase 
never reaches -180° the system is closed loop stable for any value of k > 0. 
However, to satisfy (4.110) we need k < ¢, which for a small value of € is very 
conservative indeed. 

Later we will consider cases where the phase of L is allowed to vary freely, 
and in which case Theorem 4.16 is not conservative. Actually, a clever use of 
the above theorem is the main idea behind most of the conditions for robust 
stability and robust performance presented later in this book. 


Proof of Theorem 4.16: The generalized Nyquist theorem (Theorem 4.14) says that 
if L(s) is stable, then the closed-loop system is stable if and only if the Nyquist 
plot of det(I + L(s)) does not encircle the origin. To prove condition (4.110) we will 
prove the ‘reverse’, that is, if the system is unstable and therefore det(J + L(s)) does 
encircle the origin then there is an eigenvalue, \;(L(jw)), which is larger than 1 at 
some frequency. If det(J + L(s)) does encircle the origin, then there must exists a 
gain € € (0, 1] and a frequency w’ such that 


det(I + eL(jw')) =0 (4.111) 


This is easily seen by geometric arguments since det(I + ¢L(jw')) = 1 for « = 0. 
(4.111) is equivalent to (see eigenvalue properties in Appendix A.2.1) 


@ [Pr t+eL(ju')) =0 (4.112) 


a 
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° 1+6A,(L(jw')) =0 for some i (4.113 
° Ai(L(jw')) = ee for some 7 (4.114 
€ 
> |\i(L(jw'))| > 1 for some i (4.115 
& — p(L(ju")) 21 (4.116 


The Small Gain Theorem below follows directly from Theorem 4.16 if we 
consider a matrix norm satisfying ||AB|| < ||Al| - || B||, since at any frequency 
we then have p(L) < ||L|| (see (A.114)). 


Theorem 4.17 Small Gain Theorem. Consider a system with a stable 
loop transfer function L(s). Then the closed-loop system is stable if 


|ZUjw)|| <1 Vw (4.117) 
where ||L|| denotes any matrix norm satisfying ||AB|| < ||All - ||Bll. 


Remark 1 This result is only a special case of a more general small gain theorem 
which also applies to some nonlinear systems (Vidyasagar, 1978). 


Remark 2 The small gain theorem does not consider phase information, and is 
therefore independent of the sign of the feedback. 


Remark 3 Any induced norm can be used, for example, the singular value, a(L). 


Remark 4 The small gain theorem can be extended to include more than one block 
in the loop, e.g., L = L; Lo. In this case we get from (A.95) that the system is stable 
if ||Z1 |] - [Lal] < 1, Vw. 


Remark 5 The small gain theorem is generally more conservative than the 
spectral radius condition in Theorem 4.16. Therefore, the arguments made following 
Theorem 4.16 on conservativeness also apply to Theorem 4.17. 


4.10 System norms 


Consider the system in Figure 4.9, with stable transfer function matrix G(s) 
and impulse response matrix g(t). To evaluate the performance we ask the 
question: given information about the allowed input signals w, how large 
can the outputs z become? To answer this question we must evaluate the 
system norm relevant to the input class and how we measure the outputs. 
We here present three commonly used system norms, namely the H2, Hoo 
and Hankel. We introduced the Hz and H,.. norms in Section 2.7, where we 
also discussed the unusual terminology. In Appendix A.5.7 we present a more 
detailed interpretation and comparison of these and other norms. 
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Figure 4.9: System G. 


4.10.1 Hs. norm 


Consider a strictly proper system G(s), i.e. D = 0 in a state-space realization. 
For the Hz norm we use the Frobenius norm spatially (for the matrix) and 
integrate over frequency, i.e. 


IG@a2 | f we@Gay"GGe)) ae (4.18) 


20 
GG) IR=D0, Giz (Fw) |? 


aj 
We see that G(s) must be strictly proper, otherwise the H2 norm is infinite. 
The Hz norm can be given another interpretation. Let G(s) = C(sf —A)~1B. 
By Parseval’s theorem (4.118) is equal to the Hz norm of the impulse response 


|G(s)ll = llg()ll [ tr(g" (r)g(r)) dr (4.119) 
laOlp=o,, l9i3(7)/? 


Remark. Note that G(s) and g(t) are dynamic systems while G(jw) and g(r) are 
constant matrices (for a given value of w or 7). 


We can change the order of integration and summation in (4.119) to get 


G(s) = lol = | i, ” gs Par (4.120) 


From this we see that the H2 norm can be interpreted as the two-norm output 
resulting from applying unit impulses 6;(t) to each input, one after another 
(allowing the output to settle to zero before applying an impulse to the next 
input). This is seen more clearly seen by writing ||G(s)||3 = />>;_, |lzi()(|/3 
where 2;(t) is the output vector resulting from applying an unit impulse 6; (t) 
to the 7’th input. In summary, we have the following deterministic performance 
interpretation of the Hz norm: 


|G(s)ll2 = max Ilz(4)|l2 (4.121) 


w(t) unit impulses 
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The Hz norm can also be given a stochastic interpretation (see page 388) in 
terms of the quadratic criterion in optimal control (LQG) where we measure 
the expected root mean square (rms) value of the output in response to white 
noise excitation. 

By substituting (4.10) into (4.119) we find 


|G(s)llo = /tr(BTQB) or |G(s)|lb = y/tr(CPCT) (4.122) 


where Q and P are the observability and controllability Gramians, 
respectively, obtained as solutions to the Lyapunov equations (4.49) and 
(4.44). (4.122) is used for numerical computations of the H2 norm. 


4.10.2 H. norm 


Consider a proper linear stable system G(s) (i.e. D 4 0 is allowed). For the 
H.» norm we use the singular value (induced 2-norm) spatially (for the matrix) 
and pick out the peak value as a function of frequency 


|G(s) loo = max o(G(jw)) (4.123) 


In terms of performance we see from (4.123) that the H. norm is the peak 
of the transfer function magnitude, and by introducing weights the H.. norm 
can be interpreted as the magnitude of some closed-loop transfer function 
relative to a specified upper bound. This leads to specifying performance in 
terms of weighted sensitivity, mixed sensitivity, and so on. 

However, the #,. norm also has several time domain performance 
interpretations. First, as discussed in Section 3.3.5, it is the worst case steady- 
state gain to sinusoids at any frequency. Furthermore, from Tables A.1 and 
A.2 in the Appendix we see that the H.. norm is equal to the induced (“worst- 
case”) 2-norm in the time domain: 


|G(s)]loo = max (22 nase jle(t)|h (4.124) 


ax => x 
w(t)40 ||w(t)|l2 — |feo(t)jo=t 


This is a fortunate fact from functional analysis which is proved, for example, 
in Desoer and Vidyasagar (1975). In essence, (4.124) forms because the worst 
input signal w(t) is a sinusoid with frequency w* and a direction which gives 
a(G(jw*)) as the maximum gain. (4.124) also shows that the H. norm is 
an induced norm, and thus possesses all the nice properties of such norms, 
e.g. the multiplicative property in (4.134). The Ho norm is also equal to 
the induced power (rms) norm, and also has an interpretation as an induced 
norm in terms of the expected values of stochastic signals. All these various 
interpretations make the H.. norm useful in engineering applications. 


ELEMENTS OF LINEAR SYSTEM THEORY 165 


The H. norm is usually computed numerically from a state-space 
realization as the smallest value of y such that the Hamiltonian matrix H 
has no eigenvalues on the imaginary axis, where 


A+BR ‘DTC BR-'BT 


H=|_ot4DR—ODT)C —(A+BR-DTO)T 


(4.125) 


and R = y?I — DTD, see Zhou et al. (1996, p.115). This is an iterative 
procedure where one may start with a large value of y and reduce it until 
imaginary eigenvalues for H appear. 


4.10.3 Difference between the H. and H,. norms 


To understand the difference between the H» and H., norms note that from 
(A.124) we can write the Frobenius norm in terms of the singular values. We 


then have 
|G(s)|l2 = Var / di o7 (Gj) deo (4.126) 


From this we see that minimizing the H..-norm corresponds to minimizing 
the peak of the largest singular value (“worst direction, worst frequency”), 
whereas minimizing the H2-norm corresponds to minimizing the sum of the 
square of all the singular values over all frequencies (“average direction, 
average frequency” ). In summary, we have 


¢ Hoo: “push down peak of largest singular value”. 
e H»2: “push down whole thing” (all singular values over all frequencies). 


Example 4.18 We will compute the Hoo and Hz norms for the following SISO 
plant 


G(s) = (4.127) 


The H2 norm is 


1 i , 2 i 1 =f plxane 1 1 

G =(<- G dw)? = t Baa on 4.128 
G(s) Gf (ju)|? do)* = (=—ftan (2% .)# = yf 5 (4.128) 

tat 

To check Parseval’s theorem we consider the impulse response 
1 
t=ct (—) =e“ 4t> 4.12 
g(t) =£°' (T=) =e "420 (4.129) 


and we get 


li(ll2 = if (e-wpat = ft (4.130) 
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as expected. The Ho norm is 
: 1 1 
|G(s)|]oo = sup |G(jw)| = sup T= (4.131) 
w w (w?+a?)? a 


For interest, we also compute the 1-norm of the impulse response (which is equal to 
the induced oo-norm in the time domain): 


ial = [| ge jae= + (4.132) 


In general, it can be shown that ||G(s)|lo < ||g(t)||1, and this example illustrates 
that we may have equality. 


Example 4.19 There exists no general relationships between the H2 and Heo 
norms. As an example consider the two systems 


fils) = a fo(s) = ae 


es+1’ 
and let e — 0. Then we have for fi that the Hoonorm is 1 and the H2 norm 
approaches infinity. For fo the Honorm is again 1, but now the H2 norm approaches 
zero. 


(4.133) 


Why is the #., norm so popular? In robust control we use the Ho, 
norm mainly because it is convenient for representing unstructured model 
uncertainty, and because it satisfies the multiplicative property (A.95): 


|A(s)B(s) loo S |IA(S)Ileo * B(S)Ilo0 (4.134) 


What is wrong with the 2 norm? The 72 norm has a number of good 
mathematical and numerical properties, and its minimization has important 
engineering implications. However, the Hz norm is not an induced norm and 
does not satisfy the multiplicative property. 


Example 4.20 Consider again G(s) = 1/(s + a) in (4.127), for which we found 
||G(s)|l2 = 1/1/2a. Now consider the H2 norm of G(s)G(s): 


Ieweoye= | fier = 1B = ii e(s)18 
0 n~_-_-—_—_- 


te—at 


and we find, fora <1, that 
IG(s)G(s)llo > I]G(s)lle - IG(s)lle (4.135) 


which does not satisfy the multiplicative property (A.95). On the other hand, the Hao 
norm does satisfy the multiplicative property, and for the specific example we have 


equality with ||G(s)G(s)|loo = gr = |IG(s)lleo « IIG(s)lleo- 


Figure 4.10: Pumping a swing: Illustration of Hankel operator. The input is applied 
for t < 0 and the jump starts att=0. 


It may be shown that the Hankel norm is equal to 
|G(s)|lz = Ve(PQ) (4.137) 


where p is the spectral radius (maximum eigenvalue), P is the controllability 
Gramian defined in (4.43) and Q the observability Gramian defined in (4.48). 
The name “Hankel” is used because the matrix PQ has the special structure 
of a Hankel matrix (the transpose of which has identical elements along the 
“wrong-way” diagonals). The corresponding Hankel singular values are the 
positive square roots of the eigenvalues of PQ, 


o; = /M(PQ) (4.138) 
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The Hankel and H...norms are closely related and we have (Zhou et al., 1996, 
p.111) 


G(s) ll = 1 < |IG(s)Iloo < 200 (4.139) 


Thus, the Hankel norm is always smaller than (or equal to) the H,.norm, 
which is also reasonable by comparing the definitions in (4.124) and (4.136). 

Model reduction. Consider the following problem: given a state-space 
description G(s) of a system, find a model G,(s) with fewer states such that 
the input-output behaviour (from w to z) is changed as little as possible. Based 
on the discussion above it seems reasonable to make use of the Hankel norm, 
since the inputs only affect the to outputs through the states at ¢ = 0. For 
model reduction, we usually start with a realization of G which is internally 
balanced, that is, such that, Q = P = © where © is the matrix of Hankel 
singular values. We may then discard states (or rather combinations of states 
corresponding to certain subspaces) corresponding to the smallest Hankel 
singular values. The change in Ho norm caused by deleting states in G(s) is 
less than twice the sum of the discarded Hankel singular values, i.e., 


G(s) — Ga(s)lloo < 2(on41 + on+2 + +**) (4.140) 


where G,(s) denotes a truncated or residualized balanced realization with 
k states; see Chapter 11. The method of Hankel norm minimization gives 
a somewhat improved error bound, where we are guaranteed that |/G(s) — 
Ga(s)|loo is less than the sum of the discarded Hankel singular values. This 
and other methods for model reduction are discussed in detail in Chapter 11 
where a number of examples can be found. 


Exercise 4.15 Let a = 0.5 and « = 0.0001 and check numerically the results in 
Examples 4.18, 4.19 and 4.20 using, for ecample, the MATLAB y-toolborz commands 
h2norm and hinfnorm. Also compute the Hankel norm, e.g. using the commands 
[sysb,hsig]=sysbal(sys); max(hsig). 


5 


LIMITATIONS ON 
PERFORMANCE IN SISO 
SYSTEMS 


In this chapter, we discuss the fundamental limitations on performance in SISO 
systems. We summarize these limitations in the form of a procedure for input-output 
controllability analysis, which is then applied to a series of examples. Input-output 
controllability of a plant is the ability to achieve acceptable control performance. 
Proper scaling of the input, output and disturbance variables prior to this analysis 
is critical. 


5.1 Input-Output Controllability 


In university courses on control, the methods for controller design and stability 
analysis are usually emphasized. However, in practice the following three 
questions are often more important: 

I. How well can the plant be controlled? Before starting any controller 
design one should first determine how easy the plant actually is to control. Is 
it a difficult control problem? Indeed, does there even exist a controller which 
meets the required performance objectives? 

II. What control structure should be used? By this we mean what 
variables should we measure, which variables should we manipulate, and 
how are these variables best paired together? In other textbooks one can 
find qualitative rules for these problems. For example, in Seborg, Edgar 
and Mellichamp (1989) in a chapter called “The art of process control”, the 
following rules are given: 


1. Control the outputs that are not self-regulating. 

2. Control the outputs that have favourable dynamic and static characteris- 
tics, i.e. for each output, there should exist an input which has a significant, 
direct and rapid effect on it. 
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3. Select inputs that have large effects on the outputs. 
4. Select inputs that rapidly affect the controlled variables 


These rules are reasonable, but what is “self-regulating”, “large”, “rapid” and 


“direct”. A major objective of this chapter is to quantify these terms. 

III. How might the process be changed to improve control? For 
example, to reduce the effects a disturbance one may in process control 
consider changing the size of a buffer tank, or in automotive control one might 
decide to change the properties of a spring. In other situations, the speed of 
response of a measurement device might be an important factor in achieving 
acceptable control. 

The above three questions are each related to the inherent control 
characteristics of the process itself. We will introduce the term input-output 
controllability to capture these characteristics as described in the following 
definition. 


Definition 5.1 (Input-output) controllability is the ability to achieve 
acceptable control performance; that is, to keep the outputs (y) within specified 
bounds or displacements from their references (r), in spite of unknown but 
bounded variations, such as disturbances (d) and plant changes, using available 
inputs (u) and available measurements (ym or dm). 


In summary, a plant is controllable if there exists a controller (connecting 
plant measurements and plant inputs) that yields acceptable performance 
for all expected plant variations. Thus, controllability is independent of the 
controller, and is a property of the plant (or process) alone. It can only be 
affected by changing the plant itself, that is, by (plant) design changes. These 
may include: 


changing the apparatus itself, e.g. type, size, etc. 

relocating sensors and actuators 

adding new equipment to dampen disturbances 

adding extra sensors 

adding extra actuators 

changing the control objectives 

changing the configuration of the lower layers of control already in place 


Whether or not the last two terms are as design modifications is arguable, but 
at least they address important issues which are relevant before the controller 
is designed. 

Early work on input-output controllability analysis includes that of Ziegler 
and Nichols (1943), Rosenbrock (1970) and Morari (1983) who made use of 
the concept of “perfect control”. Important ideas on performance limitations 
are also found in Bode (1945), Horowitz (1963), Frank (1968a, 1968b) , 
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Horowitz and Shaked (1975), Zames (1981), Doyle and Stein (1981), Francis 
and Zames (1984), Boyd and Desoer (1985), Kwakernaak (1985), Freudenberg 
and Looze (1985, 1988), Engell (1988), Morari and Zafiriou (1989), Boyd 
and Barratt (1991), and Chen (1995). We also refer the reader to two 
IFAC workshops on Interactions between process design and process control 
(Perkins, 1992; Zafiriou, 1994). 


5.1.1 Input-output controllability analysis 


Input-output controllability analysis is applied to a plant to find out 
what control performance can be expected. Another term for input-output 
controllability analysis is performance targeting. 

Surprisingly, given the plethora of mathematical methods available for 
control system design, the methods available for controllability analysis are 
largely qualitative. In most cases the “simulation approach” is used i.e. 
performance is assessed by exhaustive simulations However, this requires a 
specific controller design and specific values of disturbances and setpoint 
changes. Consequently, with this approach, one can never know if the result is 
a fundamental property of the plant, or if it depends on the specific controller 
designed, the disturbances or the set points. 

A rigorous approach to controllability analysis would be to formulate 
mathematically the control objectives, the class of disturbances, the model 
uncertainty, etc., and then to synthesize controllers to see whether the 
objectives can be met. With model uncertainty this involves designing a pu- 
optimal controller (see Chapter 8). However, in practice such an approach 
is difficult and time consuming, especially if there are a large number of 
candidate measurements or actuators, see Chapter 10. More desirable, is to 
have a few simple tools which can be used to get a rough idea of how easy 
the plant is to control, i.e. to determine whether or not a plant is controllable, 
without performing a detailed controller design. The main objective of this 
chapter is to derive such controllability tools based on appropriately scaled 
models of G(s) and Ga(s). 

An apparent shortcoming of the controllability analysis presented in this 
book is that all the measures are linear. This may seem restrictive, but 
usually it is not. In fact, one of the most important nonlinearities, namely 
that associated with input constraints, can be handled quite well with a 
linear analysis. Also, to deal with slowly varying changes one may perform 
a controllability analysis at several selected operating points. Nonlinear 
simulations to validate the linear controllability analysis are of course still 
recommended. Experience from a large number of case studies confirms that 
the linear measures are often very good. 
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5.1.2 Scaling and performance 


The above definition of controllability does not specify the allowed bounds for 
the displacements or the expected variations in the disturbance; that is, no 
definition of the desired performance is included. Throughout this chapter and 
the next, when we discuss controllability, we will assume that the variables and 
models have been scaled as outlined in Section 1.4, so that the requirement 
for acceptable performance is: 


e To keep the output y(t) within the range r—1 to r+1 (at least most of 
the time) for any disturbance d(t) between —1 and 1 or any reference r(t) 
between —R and R, using an input u(t) within the range —1 to 1. 


We will interpret this definition from a frequency-by-frequency sinusoidal 
point of view, i.e. d(t) = sinwt, and so on. We then have: 


e At each frequency the performance requirement is to keep the control error 
|e(w)| < 1, for any disturbance |d(w)| < 1 or any reference |r(w)| < R(w), 
using an input |u(w)| <1. 


It is impossible to track very fast reference changes, so we will assume 
that R(w) is frequency-dependent; for simplicity we assume that R(w) is R (a 
constant) up to the frequency w, and is 0 above that frequency. 

It could also be argued that the magnitude of the sinusoidal disturbances 
should approach zero at high frequencies. While this may be true, we really 
only care about frequencies within the bandwidth of the system, and in 
most cases it is reasonable to assume that the plant experiences sinusoidal 
disturbances of constant magnitude up to this frequency. Similarly, it might 
also be argued that the allowed control error should be frequency dependent. 
For example, we may require no steady-state offset, i.e. e should be zero at 
low frequencies. However, including frequency variations is not recommended 
when doing a preliminary analysis (however, one may take such considerations 
into account when interpreting the results). 

Recall that with r = Rr (see Section 1.4) the control error may be written 
as 

e=y-—r=Gu+Gad— Rr (5.1) 


where R is the magnitude of the reference and |7(w)| < 1 and |d(w)| < 1 are 
unknown signals. We will use (5.1) to unify our treatment of disturbances and 
references. Specifically, we will derive results for disturbances, which can then 
be applied directly to the references by replacing Gag by —R. 


5.1.3. Remarks on the term controllability 


The above definition of (input-output) controllability is in tune with most engineers’ 
intuitive feeling about what the term means, and was also how the term was used 


LIMITATIONS ON PERFORMANCE IN SISO SYSTEMS 173 


historically in the control literature. For example, Ziegler and Nichols (1943) defined 
controllability as “the ability of the process to achieve and maintain the desired 
equilibrium value”. Unfortunately, in the 60’s “controllability” became synonymous 
with the rather narrow concept of “state controllability” introduced by Kalman, and 
the term is still used in this restrictive manner by the system theory community. 
State controllability is the ability to bring a system from a given initial state to any 
final state within a finite time. However, as shown in Example 4.5 this gives no 
regard to the quality of the response between and after these two states and the 
required inputs may be excessive. The concept of state controllability is important 
for realizations and numerical calculations, but as long as we know that all the 
unstable modes are both controllable and observable, it usually has little practical 
significance. For example, Rosenbrock (1970, p. 177) notes that “most industrial 
plants are controlled quite satisfactorily though they are not [state] controllable”. 
And conversely, there are many systems, like the tanks in series Example 4.5, 
which are state controllable, but which are not input-output controllable. To avoid 
any confusion between practical controllability and Kalman’s state controllability, 
Morari (1983) introduced the term dynamic resilience. However, this term does not 
capture the fact that it is related to control, so instead we prefer the term input- 
output controllability, or simply controllability when it is clear we are not referring 
to state controllability. 

Where are we heading? In this chapter we will discuss a number 
of results related to achievable performance. Many of the results can be 
formulated as upper and lower bounds on the bandwidth of the system. 
However, as noted in Section 2.4.5, there are several definitions of bandwidth 
(wp, We and wr) in terms of the transfer functions S, L and T, but since we 
are looking for approximate bounds we will not be too concerned with these 
differences. The main results are summarized at end of the chapter in terms 
8 controllability rules. 


5.2 Perfect control and plant inversion 


A good way of obtaining insight into the inherent limitations on performance 
originating in the plant itself, is to consider the inputs needed to achieve 
perfect control. Let the plant model be 


y =Gu+Gad (5.2) 


“Perfect control” (which, of course, cannot be realized in practice) is achieved 
when the output is identically equal to the reference, i.e. y = r. To find the 
corresponding plant input set y =r and solve for u in (5.2): 


u=G 'r—-G'Gad (5.3) 
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(5.3) represents a perfect feedforward controller, assuming d is measurable. 
When feedback control u = A (r — y) is used we have from (2.20) that 


u= KSr— KSGad 
or since the complementary sensitivity function is T= GKS, 
u=G'Tr-G'TGad (5.4) 


We see that at frequencies where feedback is effective and T = I (these 
arguments also apply to MIMO systems), the input generated by feedback 
in (5.4) is the same as the perfect control input in (5.3). That is, high gain 
feedback generates an inverse of G even though the controller K may be very 
simple. 

An important lesson therefore is that perfect control requires the controller 
to somehow generate an inverse of G. From this we get that perfect control 
cannot be achieved if 


e G contains RHP-zeros (since then G~! is unstable) 
e G contains time delay (since then G~! contains a prediction) 
¢ G has more poles than zeros (since then G~! is unrealizable) 


In addition, for feedforward control we have that perfect control cannot be 
achieved if 


¢ Gis uncertain (since then G~! cannot be obtained exactly) 


The last restriction may be overcome by high gain feedback , but we know 
that we cannot have high gain feedback at all frequencies. 

The required input in (5.3) must not exceed the maximum physically 
allowed value. Therefore, perfect control cannot be achieved if 


e |G Gal is large 
e |G"'R| is large 


where “large” with our scaled models means larger than 1. There are also 
other situations which make control difficult such as: 


e G is unstable 
e |Ga| is large 


Tf the plant is unstable then the outputs will “take off”, and eventually hit 
physical constraints, unless feedback control is applied to stabilize the system. 
Similarly, if |Ga| is large, then without control a disturbance will cause the 
outputs to move far away from their desired values. So in both cases control is 
required, and problems occur if this demand for control is somehow in conflict 
with the other factors mentioned above which also make control difficult. We 
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have assumed perfect measurements in the discussion so far, but in practice, 
noise and uncertainty associated with the measurements of disturbances and 
outputs will present additional problems for feedforward and feedback control, 
respectively. 


5.3. Constraints on S and T 


In this section, we present some fundamental algebraic and analytic 
constraints which apply to the sensitivity S and complementary sensitivity 
Ti 


5.3.1 S plus T is one 

From the definitions S = (I+ L)~1 and T = L(I + L)~1 we derive 
S+T=1 (5.5) 

(or S+T = TI for a MIMO system). Ideally, we want S small to obtain the 

benefits of feedback (small control error for commands and disturbances), 

and T small to avoid sensitivity to noise which is one of the disadvantages of 

feedback. Unfortunately, these requirements are not simultaneously possible 


at any frequency as is clear from (5.5). Specifically, (5.5) implies that at any 
frequency either |S(jw)| or |Z(jw)| must be larger than or equal to 0.5. 


5.3.2 The waterbed effects (sensitivity integrals) 


Magnitude 


Frequency [rad/s] 


Figure 5.1: Plot of typical sensitivity, |S], with upper bound 1/|wp| 


A typical sensitivity function is shown by the solid line in Figure 5.1. We 
note that |.S| has a peak value greater than 1; we will show that this peak 
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is unavoidable in practice. Two formulae are given, in the form of theorems, 
which essentially say that if we push the sensitivity down at some frequencies 
then it will have to increase at others. The effect is similar to sitting on a 
waterbed: pushing it down at one point, which reduces the water level locally 
will result in an increased level somewhere else on the bed In general, a trade- 
off between sensitivity reduction and sensitivity increase must be performed 
whenever: 


1. L(s) has at least two more poles than zeros (first waterbed formula), or 
2. L(s) has a RHP-zero (second waterbed formula). 


Pole excess of two: First waterbed formula 


To motivate the first waterbed formula consider the open-loop transfer 
function L(s) = ACES As shown in Figure 5.2, there exists a frequency 
range over which the Nyquist plot of L(jw) is inside the unit circle centred 
on the point —1, such that |1+ L], which is the distance between L and —1, 
is less than one, and thus |S| = |1 + L|~1 is greater than one. In practice, 
L(s) will have at least two more poles than zeros (at least at sufficiently high 
frequency), so there will always exist a frequency range over which |S| is 
greater than one. This behaviour may be quantified by the following theorem, 
of which the stable case is a classical result due to Bode. 


L(jw) 


Figure 5.2: |S| > 1 whenever the Nyquist plot of L is inside the circle 


Theorem 5.2 Bode Sensitivity Integral. Suppose that the open-loop 
transfer function L(s) is rational and has at least two more poles than zeros 
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(relative degree of two or more). Suppose also that L(s) has Np, RHP-poles at 
locations p;. Then for closed-loop stability the sensitivity function must satisfy 


foe) Np 
[ In|S(jw)|dw =7- » Re(p;) (5.6) 


w=1 
where Re(p;) denotes the real part of pj. 


Proof: See Doyle et al. (1992, p. 100) or Zhou et al. (1996). The generalization of 
Bode’s criterion to unstable plants is due to Freudenberg and Looze (1985, 1988). 


For a graphical interpretation of (5.6) note that the magnitude scale is 
logarithmic whereas the frequency-scale is linear. 
Stable plant. For a stable plant we must have 


foe) 
if In|S(jw)|dw = 0 (5.7) 
0 
and the area of sensitivity reduction (In |.S| negative) must equal the area of 
sensitivity increase (In|S| positive). In this respect, the benefits and costs of 
feedback are balanced exactly, as in the waterbed analogy. From this we expect 
that an increase in the bandwidth (S smaller than 1 over a larger frequency 
range) must come at the expense of a larger peak in ||. Although this is true 
in most practical cases, the effect may not be so striking in some cases, and 
it is not strictly implied by 5.6 anyway. This is because the increase in area 
may come over an infinite frequency range; imagine a waterbed of infinite 
size. Consider |S(jw)| = 1+ 6 for w € [wi,w»], where 6 is arbitrarily small 
(small peak), then we can choose w arbitrary large (high bandwidth) simply 
by selecting the interval [w1,w] to be sufficiently large. However, in practice 
the frequency response of L has to roll off at high frequencies so wz is limited, 
and (5.6) and (5.7) impose real design limitations. 

Unstable plant. The presence of unstable poles usually increases the peak 
of the sensitivity, as seen from the positive contribution zr - ie Re(p;) in 
(5.6). Specifically, the area of sensitivity increase (|S| > 1) exceeds that of 
sensitivity reduction by an amount proportional to the sum of the distance 
from the unstable poles to the left-half plane. This is plausible since we might 
expect to have to pay a price for stabilizing the system. 


RHP-zeros: Second waterbed formula 


For plants with RHP-zeros the sensitivity function must satisfy an additional 
integral relationship, which has stronger implications for the peak of S. Before 
stating the result, let us illustrate why the presence of a RHP-zero implies 
that the peak of S must exceed one. First, consider the nonminimum phase 
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Alm 


Figure 5.3: Additional phase lag contributed by RHP-zero causes |S| > 1 


loop transfer function L(s) = Gin and its minimum phase counterpart 
Lm(s) = ewe From Figure 5.3 we see that the additional phase lag contributed 
by the RHP-zero and the extra pole causes the Nyquist plot to penetrate the 
unit circle and hence causes the sensitivity function to be larger than one. 
As a further example, consider Figure 5.4 which shows the magnitude of 
the sensitivity function for the following loop transfer function 
cj = 
s24+5 


k =0.1,0.5, 1.0, 2.0 (5.8) 


The plant has a RHP-zero z = 2, and we see that an increase in the controller 
gain k, corresponding to a higher bandwidth, results in a larger peak for S. 
For k = 2 the closed-loop system becomes unstable with two poles on the 
imaginary axis, and the peak of S is infinite. 


Theorem 5.3 Weighted sensitivity integral. Suppose that L(s) has a 
single real RHP-zero z or a complex conjugate pair of zeros z= x+ jy, and 
has N, RHP-poles, p;. Let p; denote the compler conjugate of pi. Then for 
closed-loop stability the sensitivity function must satisfy 


N, 
m : 7 |pit2 
In|S(jw)|- w(z,w)dw = 7- In — 5.9 
[1841 -w(e.) il eer (69) 
where if the zero is real 
2 2 1 
w(zZ,w) = es (5.10) 


ztw2 214 (w/z) 
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k=2.0 | 
(unstable) 


Magnitude |,5| 
> 
| 
oO 


10” 10° 10 10! 
Frequency 


Figure 5.4: Effect of increased controller gain on |,S| for system with RHP-zero at 


z=2, L(s)= ki 


and if the zero pair is compler (z =a + jy) 


w(z,w) = (5.11) 


Proof: See Freudenberg and Looze (1985; 1988). 


a Magnitude 
(log) 


VIA ee 
/ 


Figure 5.5: Plot of weight w(z,w) for case with real zero at s = z 


As shown graphically in Figure 5.5, the weight w(z,w) effectively “cuts off” 
the contribution of In|S| to the sensitivity integral at frequencies w > z. Thus, 
for a stable plant where |S| is reasonably close to 1 at high frequencies we 
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have the approximate relationship 
ze 
[ In|S(jw)|dw #0 (for |S| #1 at w> z) (5.12) 
0 


This is similar to Bode’s sensitivity integral relationship in (5.7), except that 
the trade-off between S less than 1 and S larger than 1, is done over a limited 
frequency range. Thus, in this case the waterbed is finite, and a large peak for 
|S| is unavoidable if we try to reduce |.S| at low frequencies. This is illustrated 
by the example in Figure 5.4. 

Note that when there is a RHP-pole close to the RHP-zero (p; — z) then 
Pit® _, o9, This is not surprising as such plants are in practice impossible to 
Di-—z 
stabilize. 


Exercise 5.1 Kalman inequality The Kalman inequality for optimal state 
feedback, which also applies to unstable plants, says that |S| <1,Vw, see Example 9.1. 
Explain why this does not conflict with the above sensitivity integrals. (Solution: 1. 
Optimal control with state feedback yields a loop transfer function with a pole-zero 
excess of one so (6.6) does not apply. 2. There are no RHP-zeros when all states are 
measured so (5.9) does not apply). 


The two sensitivity integrals (waterbed formulas) presented above are 
interesting and provide valuable insights, but for a quantitative analysis of 
achievable performance they are less useful. Fortunately, however, we can 
derive lower bounds on the weighted sensitivity and weighted complementary 
sensitivity, see (5.21), which are more useful for analyzing the effects of 
RHP-zeros and RHP-poles. The basis for these bounds are the interpolation 
constraints which we discuss first. 


5.3.3 Interpolation constraints 


If p is a RHP-pole of the loop transfer function L(s) then 


T(p)=1, S(p) =0 (5.13) 
Similarly, if z is a RHP-zero of L(s) then 
T(z)=0, S(z)=1 (5.14) 


These interpolation constraints follow from the requirement of internal 
stability as shown in (4.86) and (4.87). These conditions clearly restrict the 
allowable S and T and prove very useful in the next subsection. 


5.3.4 Sensitivity peaks 


In Theorem 5.3, we found that a RHP-zero implies that a peak in |S| is 
inevitable, and that the peak will increase if we reduce |S| at other frequencies. 
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Here we will derive explicit bounds on the weighted peak of S$, which are more 
useful in applications than the integral relationship. Such lower bounds to 
sensitivity were were originally derived by Zames (1981). We present below 
results for three cases: 


1. Bound on weighted sensitivity for plant with RHP-zero. 
2. Bound on weighted complementary sensitivity for plant with RHP-pole. 
3. Bounds for plant with both RHP-zero and RHP-pole. 


The results are based on the interpolation constraints S(z) = 1 and T(p) = 
given above. In addition, we make use of maximum modulus principle for 
complex analytic functions ( e.g., see maximum principle in Churchill et al. 
(1974)) which for our purposes can be stated as follows: 

Maximum modulus principle. Suppose f(s) is stable (i.e. f(s) 
analytic in the compler RHP). Then the mazimum value of |f(s)| for s in 
the right half plane is attained on the region’s boundary, t.e. somewhere along 
the jw-axis, Hence, we have for a stable f(s) 


Il FG) loo = max |f(jw)] 2 |f(so)| Vso0e RHP (5.15) 


Remark. (5.15) can be understood by imagining a 3-D plot of |f(s)| as a function 
of the complex variable s. In such a plot |f(s)| has “peaks” at its poles and “valleys” 
at its zeros. Thus, if f(s) has no poles (peaks) in the RHP, we find that | f(s)| slopes 
downwards from the LHP and into the RHP. 


To derive the results below we first consider f(s) = wp(s)S(s) (weighted 
sensitivity) and then f(s) = wr(s)T(s) (weighted complementary sensitivity). 
The weights are included to make the results more general, and we may select 
wp(s) =1and wr(s) = 1. 


Theorem 5.4 Weighted sensitivity peak. Suppose that G(s) has a RHP- 
zero z and let wp(s) be any stable weight function. Then for closed-loop 
stability the weighted sensitivity function must satisfy 


|wPS|loo 2 |wr(z)| (5.16 


Proof: Applying (5.15) to f(s) = wp(s)S(s) and using the interpolation constraint 
S(z) = 1, gives ||wpS|loo > |wp(z)S(z)| = |wP(2)]- 


Note that wp(s) = 1 in (5.16) yields the requirement ||S||,. > 1 which we 
already know must hold, and which in any case is satisfied for any real system 
since |S(jw)| must approach 1 at high frequencies. However, many useful 
relationships are derived by making other choices for the weight wp(s). This 
is discussed later in this chapter. 
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The following result is similar to Theorem 5.4, but it involves the 
complementary sensitivity T, rather than S. The basis for the result is that 
if G(s) has a RHP-pole at s = p, then for internal stability S(p) must have a 
RHP-zero at s = p and from (5.13) we have T(p) = 


Theorem 5.5 Weighted complementary sensitivity peak. Suppose that 
G(s) has a RHP-pole p and let wr(s) be any stable weight function. Then 
for closed-loop stability the weighted complementary sensitivity function must 
satisfy 


[wrT loo 2 |wr(p)| (5.17) 
Proof: Applying the maximum modulus principle in (5.15) to f(s) = wr(s)T(s) and 
using the interpolation constraint T(p) = 1 gives ||wrT||.0 > |wr(p)T(p)| = |wr(p)|. 


Consider wr(s) = 1 which yields the requirement ||T||.. > 1, and illustrates 
that some control is needed to stabilize an unstable plant (since no control, 
K =0, makes T = 0). 

The following theorem provides a generalization of Theorems 5.4 and 
thmwtt. 


Theorem 5.6 Combined RHP-poles and RHP-zeros. Suppose that 
G(s) has N, RHP-zeros z;, and has N, RHP-poles p;. Then for closed-loop 
stability the weighted sensitivity function must satisfy for each RHP-zero z; 


Np _ 
|z3 + pil 
lWpSlloo = erglwp(z;)|, ag = Il esa =] (5.18) 
ms 


and the weighted complementary sensitivity function must satisfy for each 
RHP-pole p; 


ae Dy 
|lwrT|loo 2 c2ilwr(pi)|, Cat = TT 2! ij +l 5 4 (5.19) 


ja 73 — Pal 


Proof: The basis for the theorem is a “trick” where we first factor out RHP-zeros 
in S or T into an all-pass part (with magnitude 1 at all points on the jw-axis). 
Consider first S. Since G have RHP-poles at pi, S(s) has RHP-zeros at p; and we 
may write 


S— pi 
= Yam a. = 2 
S=SaSm, Sa(s) lire (5.20) 
where, since Sa(s) is all-pass, |Sa(jw)| = 1 at all frequencies. (Remark: There is 


a technical problem here with jw-axis poles; these must first be moved slightly 
into the RHP). Consider a RHP-zero located at z, for which we get from the 
maximum modulus principle ||w,S||. = max, |wpS(jw)| = max, |weSm(jw)| > 


LIMITATIONS ON PERFORMANCE IN SISO SYSTEMS 183 


|Wp(z)Sm(z)|, where since S(z) = 1 we get Sm(z) = 1/S.(z) = c1. This proves 
5.18). 

: ee consider T. G has RHP-zeros at z;, and therefore 7 must have RHP-zeros 
at z;, so write T = TaTm with Ta = Il; ee 
problem here with jw-axis zeros; these must first be moved slightly into the RHP). 
Consider a RHP-pole located at p. Since |T.(jw)| = 1 at all frequencies, we get from 
the maximum modulus principle ||wrT||.. = max, |wrT(jw)|maxe |wrTm(jw)| > 
|wr(p)Tm(p)|, where since T(p) = 1 we get Tm(p) = 1/Ta(p) = cz. This proves 
(5.19). 


Notice that (5.16) is a special case of (5.18) with c; = 1, and (5.17) isa 
special case of (5.19) with co = 1. Also, if we select wp = wr = 1, we derive 
the following useful bounds on the peaks of S and T: 


(Remark: There is a technical 


lSlloo > MAX C1, IT loo = Max Cj (5.21) 


This shows that large peaks for S and T are unavoidable if we have a RHP- 
zero and RHP-pole located close to each other. This is illustrated by examples 
in Section 5.9. 

These bounds may be generalized to MIMO systems if the directions of 
poles and zeros are taken into account, see Chapter 6. 


5.4 Ideal ISE optimal control 


Another good way of obtaining insight into performance limitations, is to 
consider an “ideal” controller which is integral square error (ISE) optimal. 
That is, for a given command r(t) (which is zero for t < 0), the “ideal” 
controller is the one that generates the plant input u(t) (zero for t < 0) which 
minimizes 


ISE ae ly(t) — r(t) Pat (5.22) 


This controller is “ideal” in the sense that it may not be realizable in practice 
because the cost function includes no penalty on the input u(t). This particular 
problem is considered in detail by Frank (1968a, 1968b) and Morari and 
Zafiriou (1989), and also Qiu and Davison (1993) who study “cheap” LQR 
control. Morari and Zafiriou show that for stable plants with RHP-zeros at z; 
(real or complex) and a time delay 6, the “ideal” response y = T’r when r(t) 
is a unit step is given by 


“sae mney, (5.23) 


8+ 2; 


where Z; is the complex conjugate of z;. The ideal T(s) is “all-pass” with 
|T(jw)| = 1 at all frequencies. the result in (5.23) is derived by considering 
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an “open-loop” optimization problem, and applies to feedback as well as 
feedback control. Note in the feedback case that the ideal sensitivity function 
is |S(jw)| = |L7-1(jw)T (Gw)| = 1/|L(jw)| at all frequencies. 

The corresponding optimal value of the cost function is for three simple 
cases 


Delay 6: ISE = 6 (5.24) 
Real RHP — zero z : ISE = 2/z (5.25) 
Complex RHP — zeros z =a2+jy: ISE=42/(a4?+y?) (5.26) 


This quantifies nicely the limitations imposed by non-minimum phase 
behaviour, and the implications in terms if the achievable bandwidth are 
considered below. 

If r(t) is not a step then other expressions for T rather than that in (5.23) 
are derived; see Morari and Zafiriou (1989) for details. 


5.5 Limitations imposed by time delays 


7 im fl 
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Figure 5.6: “Ideal” sensitivity function (5.27) for a plant with delay. 


Consider a plant G(s) that contains a time delay e~°* (and no RHP-zeros). 
Even the “ideal” controller cannot remove this delay. For a step change in the 
reference r(t), we have to wait the a time 6 until perfect control is achieved. 
Thus, as shown in (5.23), the “ideal” complementary sensitivity function will 
be T = e—"*. The corresponding “ideal” sensitivity function is 


S=1-T=1-e% (5.27) 


The magnitude |S| is plotted in Figure 5.6. At low frequencies, w# < 1, we 
have 1 — e—° = @s (by a Taylor series expansion of the exponential) and the 
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low-frequency asymptote of |S(jw)| crosses 1 at a frequency of about 1/6 (the 
exact frequency where |S(jw)| crosses 1 in Figure 5.6 is 75 = 1.05/@). Since 
in this case |S| = 1/|L]|, we also have that 1/0 is equal to the gain crossover 
frequency for L. In practice, the “ideal” controller cannot be realized, so we 


expect this value to provide an approximate upper bound on w-, namely 
we < 1/0 (5.28) 


This approximate bound is the same as derived in Section 2.6.2 by considering 
the limitations imposed on a loop-shaping design by a time delay 0. 


5.6 Limitations imposed by RHP-zeros 


We here consider plants with a zero z in the closed right-half plane (and no 
pure time delay). In the following we attempt to build up insight into the 
performance limitations imposed by RHP-zeros using a number of different 
results in both the time and frequency domains. 

RHP-zeros typically appear when we have competing effects of slow and 
fast dynamics. For example, the plant 


1 25, —s+8 
s+1 s+10  (s+1)(s+10) 


G(s) = 


has a real RHP-zero at z = 8. We may also have complex zeros, and since 
these always occur in complex conjugate pairs we have z = r+jy where x > 0 
for RHP-zeros. 


5.6.1 Inverse response 


For a stable plant with n, RHP-zeros, it may be proven (Holt and Morari, 
1985b; Rosenbrock, 1970) that the output in response to a step change 
in the input will cross zero (its original value) n, times, that is, we have 
inverse response behaviour. A typical response for the case with one RHP- 
zero is shown in Figure 2.14, 41. We see that the output initially decreases 
before increasing to its positive steady-state value. With two RHP-zeros the 
output will initially increase, then decrease below its original value, and finally 
increase to its positive steady-state value. 


5.6.2 High-gain instability 


It is well-known from classical root-locus analysis that as the feedback gain 
increases towards infinity the closed-loop poles migrate to the positions of 
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the open-loop zeros; also see (4.79). Thus, the presence of RHP-zeros implies 
high-gain instability. 


5.6.3. Bandwidth limitation I 


For a step change in the reference we have from (5.23) that the “ideal” 
complementary sensitivity function is all-pass, and for a single real RHP-zero 
the “ideal” sensitivity function is 
28 
S= (5.29) 
S+2 

The Bode magnitude plot of |S| (= 1/|L|) is shown in Figure 5.7(a). The low- 
frequency asymptote of |S(jw)| crosses 1 at the frequency z/2. In practice, 
the “ideal” ISE optimal controller cannot be realized, and we derive (for a 
real RHP-zero) the approximate requirement 


WER We < 5 (5.30) 
which we also derived in Section 2.6.2 using loop-shaping arguments. 
For a complex pair of RHP-zeros, z = x+jy, we get from (5.23) the “ideal” 
sensitivity function 


a Ars 
(statjy)(s+2— jy) 
In Figure 5.7(b) we plot |S| for y/a equal to 0.1, 1, 10 and 50. An analysis of 
(5.31) and the figure yields the following approximate bounds 


(5.31) 


|z|/4  Re(z) > Im(z) 
WBRRWe << |2z|/2.8 Re(z) = Im(z) (5.32) 
lz Re(z) « Im(z) 


In summary, RHP-zeros located close to the origin (with |z| small) are bad 
for control, and it is worse for them to be located closer to the real axis then 
the imaginary axis. 


Remark. For a complex pair of zeros, z = x + jy, we notice from (5.31) and 
Figure 5.7 that the resonance peak for S at w % y becomes increasingly “thin” as 
the zero approaches the imaginary axis (« — 0). Thus, for a zero located at the 
imaginary axis (x = 0) the ideal sensitivity function is zero at all frequencies, except 
for a single “spike” at w = y where it jumps up to 2. The integral under the curve 
for |S(jw)|? thus approaches zero, as does the ideal ISE-value in response to a step 
in the reference, see also (5.26). This indicates that purely imaginary zeros do not 
always impose limitations. This is also confirmed by see the flexible structure in 
Example 2.10, for which the response to an input disturbance is satisfactory, even 
though the plant has a pair of imaginary zeros. However, the flexible structure is 


LIMITATIONS ON PERFORMANCE IN SISO SYSTEMS 


— 
T 


Magnitude [S| 
S 


2/2 


vl 0 


Frequency x 1/z 
(a) Real RHP-zero. 


Magnitude || 


y/x=0.1 y/z=10 y/#=50 


Figure 5.7: 


Frequency x 1/a 


(b) Complex pair of RHP-zeros, z = # + 


Jy: 


“Ideal” sensitivity functions for plants with RHP-zeros 


187 


188 MULTIVARIABLE FEEDBACK CONTROL 


a rather special case where the plant also has imaginary poles which counteracts 
most of the effect of the imaginary zeros. Therefore, in other cases, the presence of 
imaginary zeros may limit achievable performance, for example, in the presence of 
sinusoidal disturbances with frequency close to y, or with uncertainty which makes 
it difficult to place poles in the controller to counteract the imaginary zero. 


5.6.4 Bandwidth limitation IT 


Another way of deriving bandwidth limitations is from the bound (5.16) on 
weighted sensitivity given in Theorem 5.4. The idea is to select a form for 
the performance weight wp(s), and then derive a bound for the “bandwidth 
parameter” in the weight. 

As usual, we select 1/|wp| as an upper bound on the sensitivity function 
(see Figure 5.1 on page 175), that is, we require 


IS(jw)| <1/|we(w)|,ve = |lweSllo <1 (5.33) 


However, from (5.16) we have that ||wpS||.. > |wp(z)|, so to be able to satisfy 
(5.33) we must at least require that the weight satisfies 


jwp(z)| <1 (5.34) 


(We say “at least” because condition (5.16) is not an equality). We will now 
use (5.34) to gain insight into the limitations imposed by RHP-zeros; first 
by considering a weight that requires good performance at low frequencies, 
and then by considering a weight that requires good performance at high 
frequencies. 


Performance at low frequencies 
Consider the following performance weight 


8/M + wy 


stwyA ey 


wp(s) = 
This is the same weight as in (2.73). From (5.33) it specifies a minimum 
bandwidth w% (actually, w% is the frequency where the straight-line 
approximation of the weight crosses 1), a maximum peak of |S| less than 
M, a steady-state offset less than A < 1, and at frequencies lower than the 
bandwidth the sensitivity is required to improve by at least 20 dB/decade 
(i.e. [S| has slope 1 or larger on a log-log plot). If the plant has a RHP-zero 
ats = z, then from (5.34) we must require 


<1 (5.36) 
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Real zero. Consider the case when z is real. Then all variables are real 
and positive and (5.36) is equivalent to 


wh(1— A) <2 (1 _ iz) (5.37) 


For example, with A = 0 (no steady-state offset) and M = 2 (||S|loo < 2) 
we must at least require wy < 0.5z, which is consistent with the requirement 
wp <0.5z in (5.30). 

Imaginary zero. For a RHP-zero on the imaginary axis, z = j|z|, a similar 
derivation yields with A = 0: 


/ 1 


For example, with M = 2 we require w% < 0.86|z|, which is very similar to 
the requirement wy < |z| given in (5.32). peak The next two exercises show 
that the bound on wy does not depend much on the slope of the weight at 
low frequencies, or on how the weight behaves at high frequencies. 


Exercise 5.2 Consider the weight 


s+ Mw, s+ fMwe, 
8 s+ fMews, 


wp(s) = (5.39) 


with f >1. This is the same weight as (5.85) with A =0 except that it approaches 
1 at high frequencies, and f gives the frequency range over which we allow a peak. 
Plot the weight for f =10 and M = 2. Derive an upper bound on wy for the case 
with f =10 and M = 2. 


Exercise 5.3 Consider the weight wp(s) = 47+ (2B) which requires |S| to have 
a slope of n at low frequencies and requires its low-frequency asymptote to cross 1 
at a frequency wz. Note that n = 1 yields the weight (5.85) with A = 0. Derive 
an upper bound on wR when the plant has a RHP-zero at z. Show that the bound 
becomes wz < |z| as n > oo. 


Remark. The result for n — oo in exercise 5.3 is a bit surprising. It says that the 
bound w% < |z|, is independent of the required slope (n) at low frequency and is also 
independent of M. This is surprising since from Bode’s integral relationship (5.6) 
we expect to pay something for having the sensitivity smaller at low frequencies, 
so we would expect wz to be smaller for larger n. This illustrates that |wp(z)| < 1 
in (5.34) is a necessary condition on the weight (i.e. it must at least satisfy this 
condition), but since it is not sufficient it can be optimistic. For the simple weight 
(5.35), with n = 1, condition (5.34) is not very optimistic (as is confirmed by other 
results), but apparently it is optimistic for n large. 
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Performance at high frequencies 


The bounds (5.37) and (5.38) derived above assume tight control at low 
frequencies. Here, we consider a case where we want tight control at high 
frequencies, by use of the performance weight 

1 8 


wp(s) = i + Dp (5.40) 


This requires tight control (|S(jw)| < 1) at frequencies higher than w%, 
whereas the only requirement at low frequencies is that the peak of |S| is 
less than M. Admittedly, the weight in (5.40) is unrealistic in that it requires 
S — 0 at high frequencies, but this does not affect the result as is confirmed 
in Exercise 5.5 where a more realistic weight is studied. In any case, to satisfy 
|weS|loo < 1 we must at least require that the weight satisfies |wp(z)| < 1, 
and with a real RHP-zero we derive for the weight in (5.40) 


WR >z (5.41) 


1-1/M 


For example, with M = 2 the requirement is wy > 2z, so we can only achieve 
tight control at frequencies beyond the frequency of the RHP-zero. 


Exercise 5.4 Make an asymptotic magnitude Bode-plot of wp(s) in (5.40). 


Exercise 5.5 Consider the case of a plant with a RHP zero where we want to limit 

the sensitivity function over some frequency range. To this effect let 

(1000s/w + 7) (s/(Mw3) + 1) 
(10s/w% + 1)(100s/w% +1) 


wp(s) = (5.42) 


This weight is equal to 1/M at low and high frequencies, has a mazimum value of 
about 10/M at intermediate frequencies, and the asymptote crosses 1 at frequencies 
wp /1000 and wy. Thus we require “tight” control, |S| <1, in the frequency range 
between wer, =wR/1000 andwey = we. 

a) Make a sketch of 1/|wp| (which provides an upper bound on |S\). 

b) Show that the RHP-zero cannot be in the frequency range where we require tight 
control, and that we can achieve tight control either at frequencies below about z/2 
(the usual case) or above about 2z. To see this select M = 2 and evaluate wp(z) for 
various values of we = kz, e.g., k =.1,.5,1, 10, 100, 1000, 2000, 10000. (You will find 
that wp(z) =0.95 (& 1) fork =0.5 (corresponding to the requirement way < z/2) 
and for k = 2000 (corresponding to the requirement wer > 2z)) 


5.6.5 Limitations at low or high frequencies 


Based on (5.37) and (5.41) we see that a RHP-zero will pose control limitations 
either at low or high frequencies. In most cases we desire tight control at low 
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frequencies, and with a RHP-zero this may be achieved at frequencies lower 
than about 2/2. However, if we do not need tight control at low frequencies, 
then we may usually reverse the sign of the controller gain, and instead achieve 
tight control at frequencies higher than about 2z. 


Remark. The reversal of the sign in the controller is probably best understood by 
considering the inverse response behaviour of a plant with a RHP-zero. Normally, 
we want tight control at low frequencies, and the sign of the controller is based 
on the steady-state gain of the plant. However, if we instead want tight control at 
high frequencies (and have no requirements at low frequencies) then we base the 
controller design on the plants initial response where the gain is reversed because of 
the inverse response. 


Example 5.1 To illustrate this, consider in Figures 5.8 and 5.9 the use of negative 
and positive feedback for the plant 

SAS 
oe 
More precisely, we show in the figures the sensitivity function and the time response 
to a step change in the reference using 


1. Pl-control with negative feedback (Figure 5.8) 
2. derivative control with positive feedback (Figure 5.9). 


G(s) = 


Note that the time scales for the simulations are different. For positive feedback 
the step change in reference only has a duration of 0.1 s, because we cannot track 
references over much longer times as the RHP-zero causes the the output to start 
drifting away (as can be seen in Figure 5.9 (b)). 


Setpoint 


Magnitude || 


10” 10” 10" Ot” = sae oa 
Frequency [rad/s] Time 
(a) Sensitivity function. (b) Response to step in reference. 


Figure 5.8: Control of plant with RHP-zero at z = 1 using negative feedback 


G(s) = at Ki(s) = Ke st Mca 


Note that G(s) & 1 at low frequencies (vw < z), whereas G(s) % —1 at high 
frequencies (w >> z). The negative plant gain in the latter case explains why we then 
use positive feedback in order to achieve tight control at high frequencies. 
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Setpoint 
! K.=0.9 
ca 
5 
= 0.5 
a=] a 
a0 
5 
0 
10° 10° 10° 0 0.05 0.1 
Frequency [rad/s] Time 
(a) Sensitivity function. (b) Response to step in reference. 


Figure 5.9: Control of plant with RHP-zero at z = 1 using positive feedback. 


G(s) = 4, Ko(s) = —Ke 


s 
(0.05s+1)(0.02s+1) 


Short-term control. In this book, we generally assume that the system 
behaviour as t > co is important. However, this is not true in some cases 
because the system may only be under closed-loop control for a finite time t,. 
In which case, the presence of a “slow” RHP-zero (with |z| small), may not 
be significant provided ty < 1/|z|. For example, in Figure 5.9 (b) if the total 
control time is ty = 0.01[s], then the RHP-zero at z = 1[rad/s] is insignificant. 


Remark 1 As a example of short-term control, consider treating a patient with 
some medication. Let u be the dosage of medication and y the condition of the 
patient. With most medications we find that in the short-term the treatment has 
a positive effect, whereas in the long-term the treatment has a negative effect (due 
to side effects which may eventually lead to death. However, this inverse-response 
behaviour (characteristic of a plant with a RHP-zero) may be largely neglected 
during limited treatment, although one may find that the dosage has to be increased 
during the treatment to have the desired effect. Interestingly, the last point is 
illustrated by the upper left curve in Figure 5.10, which shows the input u(t) using 
an internally unstable controller which over some finite time may eliminate the effect 
of the RHP-zero. 


Remark 2 An important case, where we can only achieve tight control at high 
frequencies, is characterized by plants with a zero at the origin, for example 
G(s) = s/(5s+1). In this case, good transient control is possible, but the control has 
no effect at steady-state. The only way to achieve tight control at low frequencies is 
to use an additional actuator (input) as is often done in practice. 


Exercise 5.6 In the simulations in Figures 5.8 and 5.9, we use simple PI- and 
derivative controllers. As an alternative use the S/KS method in (3.55) to synthesize 
Heo controllers for both the negative and positive feedback cases. Use performance 
weights in the form given by (5.35) and (5.40), respectively. With wp = 1000 and 
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M = 2 in (6.40) and wy = 1 (for the weight on KS) you will find that the time 
response is quite similar to that in Figure 5.9 with Ke = 0.5. Try to improve the 
response, for example, by letting the weight have a steeper slope at the crossover near 
the RHP-zero. 


5.6.6 Remarks on the effects of RHP-zeros 


1. The derived bound w. < z/2 for the case of tight control at low frequencies is 
consistent with the bound w, < 1/0 found earlier for a time delay. This is seen 
from the following Padé approximation of a delay, e~°° & (1— $s)/(1 + $s), 
which has a RHP-zero at 2/0. 

2. LHP-zero. Zeros in the left-half plane, usually corresponding to “overshoots” 
in the time response, do not present a fundamental limitation on control, but 
in practice a LHP-zero located close to the origin may cause problems. First, 
one may encounter problems with input constraints at low frequencies (because 
the steady-state gain maybe too low). Second, a simple controller can probably 
not then be used. For example, a simple PID controller as in (5.69) contains no 
adjustable poles that can be used to counteract the effect of a LHP-zero. 

3. For uncertain plants, zeros can cross from the LHP to the RHP both through 
zero and through infinity. We discuss this in Chapter 7. 


5.7 Non-causal controllers 


Perfect control can be achieved for a plant with a time delay or RHP-zero if we 
use a non-causal controller’, i.e. a controller which uses information about the 
future. This may be relevant for certain servo problems, e.g. in robotics and 
batch processing. A brief discussion is given here, but non-causal controllers 
are not considered in the rest of the book because future information is usually 
not available. 

Time delay. For a delay e-’* we may achieve perfect control with a non- 
causal feedforward controller K, = e°* (a prediction). Such a controller may 
be used if we have knowledge about future changes in r(t) or d(t). For example, 
if we know that we should be at work at 8:00, and we know that it takes 30 
min to get to work, then we make a prediction and leave home at 7:30. 

RHP-zero. Future knowledge can also be used to give perfect control in 
the presence of a RHP-zero. As an example, consider a plant with a real 
RHP-zero given by 


Os 


—S He. 
stz’ 


G(s) = z>0 (5.44) 


TA system is causal if its outputs depends only on past inputs, and non-causal if its outputs 
also depends on future inputs. 
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and a desired reference change 
0 t<0 
rays { L450 


With a feedforward controller K; the response from r to y is y = G(s) K;(s)r. 
In theory we may achieve perfect control (y(t)=r(t)) with the following two 
controllers (Eaton and Rawlings, 1992). 


1. A causal unstable controller 


Soe 
Bb 


K;,(s) = 


For a step in r from 0 to 1 at t = 0, this controller generates the following 
input signal 

0 t<0 

1—2e* 4t>0 


However, since the controller cancels the RHP-zero in the plant it yields 
an internally unstable system. 

2. A stable non-causal controller that assumes that the future setpoint change 
is known. This controller cannot be represented in the usual transfer 
function form, but it will generate the following input 


2e7! t<0 
wt) = { 1 +#>0 


These input signals u(t) and the corresponding outputs y(t) are shown in 
Figure 5.10 for a plant with z = 1. Note that for perfect control the non-causal 
controller needs to start changing the input at t = —oo, but for practical 
reasons we started the simulation at t = —5 where u(t) = 2e-° = 0.018. 

The first option, the unstable controller, is not acceptable as it yields an 
internally unstable system in which u(t) goes to infinity as t increases (an 
exception may be if we want to control the system only over a limited time 
tr; see Remark 5.6.5 on page 192). 

The second option, the non-causal controller, is usually not possible because 
future setpoint changes are unknown. However, if we have such information, 
it is certainly beneficial for plants with RHP-zeros. 

However, in most cases we have to accept the poor performance resulting 
from the RHP-zero and use a stable causal controller, the third option. The 
ideal causal feedforward controller in terms of minimizing the ISE (Hz norm) 
of y(t) for the plant in (5.44) is to use AK, = 1, and the corresponding plant 
input and output responses are shown in the lower plots in Figure 5.10. 
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Input: unstable controller 


Output: unstable controller 
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Figure 5.10: Feedforward control of plant with RHP-zero 


5.8 Limitations imposed by RHP-poles 


We here consider the limitations imposed when the plant has a RHP-pole 
(unstable pole) at s = p. For example, the plant G(s) = 1/(s—3) has a 
RHP-pole at s = 3. 

For unstable plants we need feedback for stabilization. Thus, whereas 
the presence of RHP-zeros usually places an upper bound on the allowed 
bandwidth, the presence of RHP-poles generally imposes a lower bound. It 
also follows that it may be more difficult to stabilize an unstable plant if there 
are RHP-zeros or a time delay present (“the system goes unstable before we 
have time to react”). These qualitative statements are quantified below where 
we derive bounds on the bandwidth in terms of T. 

We start by selecting a weight wr(s) such that 1/|w7| is a reasonable upper 
bound on the complementary sensitivity function. 


IT(jw)| <1/\wrGw)|, Veo << |lwrT loo <1 (5.45) 


However, from (5.17) we have that ||wrT lo. > |wr(p)|, so to be able to satisfy 
(5.45) we must at least require that the weight satisfies 


|wr(p)| <1 (5.46) 
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Magnitude 


Frequency [rad/s] 


Figure 5.11: Typical complementary sensitivity, |T], with upper bound 1/|w7| 


Now consider the following weight 


8 1 
wr(s) aa + My (5.47) 
which requires T (like ||) to have a roll-off rate of at least 1 at high frequencies 
(which must be satisfied for any real system), that |T| is less than M7 at low 
frequencies, and that |T| drops below 1 at frequency wi, The requirement 
on |7| is shown graphically in Figure 5.11. 
Real RHP-pole at s = p. For the weight (5.47) condition (5.46) yields 


Mr 


ad A 
My (5.48) 


Wap > P 

Thus, the presence of the RHP-pole puts a lower limit on the bandwidth in 

terms of T; that is, we cannot let the system roll-off at frequencies lower 

than p. For example, with My = 2 we get why > 2p, which is approximately 
achieved if 

We > 2p (5.49) 


Imaginary RHP-pole. For a purely imaginary pole located at p = j|p| 
a similar analysis of the weight (5.47) with Mr = 2, shows that we must at 
least require w%7 > 1.15|p|, which is achieved if 


we > 1.15|p| (5.50) 


In conclusion, we find that stabilization with reasonable performance requires 
a bandwidth which is larger than the distance |p| of the RHP-pole from the 
origin. 


Exercise 5.7 Derive the bound in (5.50). 
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5.9 Combined RHP-poles and RHP-zeros 


Performance. We have above considered separately the effects of RHP-zeros 
and RHP-poles. In order to get acceptable performance and robustness, we 
derived for a real RHP-zero, the approximate bound w, < z/2 (assuming we 
want tight control below this frequency); and for a real RHP-pole, we derived 
the approximate bound w, > 2p. From these we get that for a system with a 
single real RHP-pole and a RHP-zero we must approximately require z > 4p 
in order to get acceptable performance and robustness. 

Stabilization. In theory, any linear plant may be stabilized irrespective of 
the location of its RHP-poles and RHP-zeros. However, for practical purposes 
it is sometimes desirable that the controller itself is stable. If such a controller 
exists the plant is said to be strongly stabilizable. It has been proved by Youla 
et al. (1974) that a strictly proper plant is strongly stabilizable if and only if 
every real RHP-zero in G(s) lies to the left of an even number (including zero) 
of real RHP-poles in G(s). Note that the presence of any complex RHP-poles 
or complex RHP-zeros does not affect this result. We then have: 


e A strictly proper plant with a single real RHP-zero z and a single real RHP- 
pole p, e.g., G(s) = (Carereane can be stabilized by a stable controller if 
and only if z > p 


This shows that for plants with a single RHP-pole and RHP-zero our 
approximate performance requirement z > 4p is quite close to the strong 
stabilizability requirement z > p. 

The following example for a plant with z = 4p shows that we can indeed get 
acceptable performance when the RHP-pole and zero are located this close. 


Example 5.2 H. design for plant with RHP-pole and RHP-zero. We 
want to design an Ho controller for a plant with z= 4 and p=1, 


s—4 
(s —1)(0.1s + 1) 


We use the S/KS design method as in Example 2.11 with input weight W, = 1 
and performance weight (5.85) with A=0, M = 2, wR = 1. The software gives a 
stable and minimum phase controller with an Ho norm of 1.89. The corresponding 
sensitivity and complementary sensitivity functions, and the time response to a untt 
step reference change are shown in Figure 5.12. The response is good, taking into 
account the closeness of the RHP-pole and zero. 


G(s) = (5.51) 


Sensitivity peaks In Theorem 5.6 we derived lower bounds on the 
weighted sensitivity and complementary sensitivity. For example, for a plant 
with a single real RHP-pole p and a single real RHP-zero z, we always have 


|S |loo = Cc, |T loo > C, e— lz +P 
lz — pl 


(5.52) 
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Figure 5.12: 7H. design of a plant with RHP-zero at z = 4 and RHP-pole at p = 1 


Example 5.2 continued. With z = 4p, (5.52) gives c = 5/3 = 1.67 and it follows 
from that for any controller we must at least have ||S||oo > 1.67 and ||T ||. > 1.67. 
The actual peak values for the above S/KS-design are 2.40 and 2.43, respectively. 


Example 5.3 Balancing a rod. This example is taken from Doyle et al. (1992) 
Consider the problem of balancing a rod in the palm of one’s hand. The objective 
is to keep the rod upright, by small hand movements, based on observing the rod 
either at its far end (output y1) or the end in one’s hand (output yo). The linearized 
transfer functions for the two cases are 


~9 ls? —g 


aie s? (Mls? — (M + m)g)’ Se s? (Mls? —(M+m)g) 


Here | [m] is the length of the rod and m [kg] its mass. M [kg] is the mass of your 
hand and g [= 10 m/s] is the acceleration due to gravity. In both cases, the plant 


has three unstable poles: two at the origin and one at p = 4/ ee A short rod 
with a large mass gives a large value of p, and this in turn means that the system is 
more difficult to stabilize. For example, with M =m andl =1 [m] we get p= 4.5 
[rad/s] and from (5.49) we desire a bandwidth of about 9 [rad/s] (response time less 
than about 0.1 [s/). 

If one is measuring yi (looking at the far end of the rod) then achieving this 
bandwidth is the main requirement. However, if one tries to balance the rod by looking 
at one’s hand (y2) there is also a RHP-zero at z = V3. If the mass of the rod is 
small (m/M is small), then p is close to z and stabilization is in practice impossible 
with any controller. However, even with a large mass, stabilization is very difficult 
because p > z whereas we would normally prefer to have the RHP-zero far from the 
origin and the RHP-pole close to the origin (z > p). So although in theory the rod 
may be stabilized by looking at one’s hand (G2), it seems doubtful that this is possible 
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for ahuman. To quantify these problems use (5.52). We get 


paket [tal | [Mem 
le—pl [P41 M 
Consider a light weight rod with m/M = 0.1, for which we expect stabilization to be 
difficult. We obtain c = 42, and we must have ||S||o. > 42 and ||T||oo > 42, so poor 
control performance is inevitable if we try to balance the rod by looking at our hand 
(yo). 


The difference between the two cases, measuring yi: and measuring yo, highlights 
the importance of sensor location on the achievable performance of control. 


5.10 Performance requirements imposed by 
disturbances and commands 


The question we here want to answer is: How fast must the control system be 
in order to reject disturbances and track commands of a given magnitude? We 
find that some plants have better “built-in” disturbance rejection capabilities 
than others. This may be analyzed directly by considering the appropriately 
scaled disturbance model, Ga(s). Similarly, for tracking we may consider the 
magitude R of the reference change. 

Disturbance rejection. Consider a single disturbance d and assume 
that the reference is constant, i.e., r = 0. Without control the steady-state 
sinusoidal response is e(w) = Gg(jw)d(w); recall (2.9). If the variables have 
been scaled as outlined in Section 1.4 then the worst-case disturbance at any 
frequency is d(t) = sinwt, ie., |d(w)| = 1, and the control objective is that 
at each frequency |e(t)| < 1, ie., |e(w)| < 1. From this we can immediately 
conclude that 


e no control is needed if |Ga(jw)| <1 at all frequencies (in which case the 
plant is said to be “self-regulated”). 


If |Ga(jw)| > 1 at some frequency, then we need control (feedforward or 
feedback). In the following, we consider feedback control, in which case we 
have 


e(s) = S(s)Ga(s)d(s) (5.53) 


The performance requirement |e(w)| < 1 for any |d(w)| < 1 at any frequency, 
is satisfied if and only if 


|SGa(jw)| <1 Vw <= ||SGallo <1 (5.54) 


= |S(jw)| < 1/|Ga(jw)| Vw (5.55) 
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A typical plot of 1/|Ga(jw)| is shown in Figure 5.13 (dotted line). If the plant 
has a RHP-zero at s = z, which fixes S(z) = 1, then using (5.16) we have the 
following necessary condition for satisfying ||SGa|loo. <1: 


IGa(z)| <1 (5.56) 


From (5.55) we also get that the frequency wg where |Gq| crosses 1 from above 
yields a lower bound on the bandwidth: 


wp >wa| where wy is defined by |Ga(jwa)| =1 (5.57) 


A plant with a small |G‘a| or a small wg is preferable since the need for feedback 
control is then less, or alternatively, given a feedback controller (which fixes 
S) the effect of disturbances on the output is less. 


Wd - 7 = 
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Figure 5.13: Typical performance requirement on S imposed by disturbance 
rejection 


Example 5.4 Assume that the disturbance model is Ga(s) = ka/(1+7as) where 
ka = 10 and ta = 100 [seconds]. Scaling has been applied to Ga so this means 
that without feedback, the effect of disturbances on the outputs at low frequencies is 
ka = 10 times larger than we desire. Thus feedback is required, and since |Gq| crosses 
1 at a frequency wa & ka/ta = 0.1 rad/s, the minimum bandwidth requirement for 
disturbance rejection is wp > 0.1 [rad/s]. 


Remark. Gy is of high order. The actual bandwidth requirement imposed by 
disturbances may be higher than wa if |Ga(jw)| drops with a slope steeper than —1 
(on a log-log plot) just before the frequency wa. The reason for this is that we must, 
in addition to satisfying (5.55), also ensure stability with reasonable margins; so as 
discussed in Section 2.6.2 we cannot let the slope of |L(jw)| around crossover be 
much larger than —1. 
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An example, in which G'q(s) is of high order, is given later in Section 5.16.3 for 
a neutralization process. There we actually overcome the limitation on the slope 
of |L(jw)| around crossover by using local feedback loops in series. We find that, 
although each loop has a slope —1 around crossover, the overall loop transfer function 
L(s) = [1(s)L2(s)---In(s) has a slope of about —n; see the example for more 
details. This is a case where stability is determined by each J + L; separately, but 
the benefits of feedback are determined by 1+]], Li (also see Horowitz (1991, p. 284) 
who refers to lectures by Bode). 


Command tracking. Assume there are no disturbances, i.e., d = 0, and 
consider a reference change r(t) = Rr(t) = Rsin(wt). Since e = Gu+Gad—Rr, 
the same performance requirement as found for disturbances, see (5.54), 
applies to command tracking with Gq replaced by —R. Thus for acceptable 
control (|e(w)| <1) we must have 


|S(jw)R| < 1,Vw < w, (5.58) 
where w, is the frequency up to which performance tracking is required. 


Remark. The bandwidth requirement imposed by (5.58) depends on on how sharply 
|S(jw)| increases in the frequency range from w, (where |S| < 1/R) to wz (where 
|S| & 1). If |S] increases with a slope of 1 then the approximate bandwidth 
requirement becomes wz > Rw,, and if |S| increases with a slope of 2 it becomes 


we > VRwr. 


5.11 Limitations imposed by input constraints 


In all physical systems there are limits to the changes that can be made to 
the manipulated variables. In this section, we assume that the model has been 
scaled as outlined in Section 1.4, so that at any time we must have |u(t)| < 1. 
The question we want to answer is: Can the expected disturbances be rejected 
and can we track the reference changes while maintaining |u(t)| <1? We will 
consider separately the two cases of perfect control (e = 0) and acceptable 
control ({e| < 1). These results apply to both feedback and feedforward 
control. 

At the end of the section we consider the additional problems encountered 
for unstable plants (where feedback control is required). 


Remark 1 We use a frequency-by-frequency analysis and assume that at each 
frequency |d(w)| < 1 (or |r(w)| < 1). The worst-case disturbance at each frequency 
is |d(w)| = 1 and the worst-case reference is r = RF with |r(w)| = 1. 


Remark 2 Note that rate limitations, |du/dt| < 1, may also be handled by our 
analysis. This is done by considering du/dt as the plant input by adding a term 1/s 
to the plant model G(s). 
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Remark 3 Below we require |u| < 1 rather than |u| < 1. This has no practical 
effect, and is used to simplify the writing. 


5.11.1 Inputs for perfect control 
From (5.3) the input required to achieve perfect control (e = 0) is 


u=G'r—-G'Gad (5.59) 

Disturbance rejection. With r = 0 and |d(w)| = 1 the requirement 
|u(jw)| < 1 is equivalent to 

G7! (jw)Ga(jw)| < 1 Ww (5.60) 


In other words, to achieve perfect control and avoid input saturation we need 
|G| > |Ga| at all frequencies. (However, as is discussed below, we do no really 
need control at frequencies where |Ga| < 1.) 

Command tracking. Next let d = 0 and consider the worst-case reference 
command which is |r(w)| = R at all frequencies up to w,. To keep the inputs 
within their constraints we must then require from (5.59) that 


IG! (jw)R) <1 Vo <u, (5.61) 


In other words, to avoid input saturation we need |G| > R at all frequencies 
where perfect command tracking is required. 


Example 5.5 Consider a process with 


40 BOs +1 
Grhnesriy 9) =3a yes 


G(s) = 


From Figure 5.14, we see that condition (5.60) is not satisfied for w > w,. However, 
for frequencies w > wa we do not really control. Thus, in practice, we expect that 
disturbances in the frequency range between w; and wa may cause input saturation. 


5.11.2 Inputs for acceptable control 


Above we assumed for simplicity perfect control. However, perfect control 
is never really required, especially not at high frequencies, and the input 
magnitude required for acceptable control (namely |e(jw)| < 1) is somewhat 
smaller. For disturbance rejection we must require 


|G| > |Ga| — 1] at frequencies where |Ga| > 1 (5.62) 
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Figure 5.14: Input saturation is expected for disturbances at intermediate 
frequencies from w to wa 


Proof: Consider a “worst-case” disturbance with |d(w)| = 1. The control error is 
e =y = Gu+Gad. Thus at frequencies where |Ga(jw)| > 1 the smallest input needed 
to reduce the error to |e(w)| = 1 is found when u(w) is chosen such that the complex 
vectors Gu and Gad have opposite directions. That is, |e] = 1 = |Gad| — |G'u|, and 
with |d| = 1 we get |u| = |G~'|(|Ga| —1), and the result follows by requiring |u| < 1. 


Similarly, to achieve acceptable control for command tracking we must 
require 
IG|>|R|-1<1 W<w, (5.63) 


In summary, if we want “acceptable control” (|e| < 1) rather than “perfect 
control” (e = 0), then |G4g| in (5.60) should be replaced by |Ga| — 1, and 
similarly, R in (5.61) should be replaced by R— 1. The differences are clearly 
small at frequencies where |G4y| and |R| are much larger than 1. 

The requirements given by (5.62) and (5.63) are restrictions imposed on 
the plant design in order to avoid input constraints and they apply to any 
controller (feedback or feedforward control). If these bounds are violated at 
some frequency then performance will not be satisfactory (i.e, |e(w)| > 1) for 
a worst-case disturbance or reference occurring at this frequency. 


5.11.3. Unstable plant and input constraints 


Feedback control is required to stabilize an unstable plant. However, input 
constraints combined with large disturbances may make stabilization difficult. 
For example, for an unstable plant with a real RHP-pole at s = p we 
approximately need 


IG| > |Gal Yo <p (5.64) 
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Otherwise, the input will saturate when there is a sinusoidal disturbance 
d(t) = sinwt, and we may not be able to stabilize the plant. Note that the 
frequency p may be larger than the frequency wg at which |Gqg(jwa)| = 1. 


Proof of (§.64):. With feedback control the input signal is u = —KSGad = 
—G~'TGad. We showed in (5.48) that we need |T(jw)| > 1 up to the frequency 
p. Thus we need |u| > |G7'Ga|- |d| up to the frequency p, and to have |u| < 1 for 
|\d| = 1 (the worst-case disturbance) we must require |G~'Ga| < 1. 


Example 5.6 Consider 


5 ka 
(sehen C= GEpeIe+ 


G(s) = ka <1 (5.65 
Since ka <1 and the performance objective is |e| <1, we do not really need control 
for disturbance rejection, but feedback control is required for stabilization, since the 
plant has a RHP-pole al p = 1. We have |G| > |Ga| (i.e, |G7'Ga| < 1) for 
frequencies lower than 0.5/ka, so from (5.64) we do not expect problems with input 
constraints at low frequencies. However, at frequencies higher than 0.5/ka we have 
|G| < |Ga|, as is illustrated in Figure 5.15 (a). Thus from (5.64), we may expect to 
have problems with instability if 0.5/ka <p, i.e. if ka > 0.5. 


10 ( ) 
a ult 
\ 
IG 1 f - 
wai \ Unconstrained: — ~~ 
3 10 0.5 \ Constrained: 
£ ‘ 
& IGa| ere a 
a sal ain 
= 10 
7 p 
10S > 
10 10 5 10 
Frequency [rad/s] Time [sec] 


(a) Gand Gq with kg = 0.5. (b) Response to step in disturbance. 


Figure 5.15: Instability caused by input saturation for unstable plant 


To check this for a particular case we select kq =0.5 (the limiting value) and use 


the controller 5 
aoe 0.04 (10s + 1) 

s (0.1s + 1)? 
which without constraints yields a stable closed-loop system with a gain crossover 
frequency, we, of about 1.7. The closed-loop response to a unit step disturbance 
occurring after 1 second is shown in Figure 5.15 (b) both for the linear case when 
there are no input constraints (dashed line), and where u is constrained to be within 
the interval [—1 1] (solid lines). We see from the latter response that the system is 
indeed unstable when there is a disturbance that drives the system into saturation. 


(5.66) 
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Remark. Interestingly, for this example, a small reduction in the disturbance 
magnitude from ka = 0.5 to kq = 0.48 results in a stable closed-loop response in 
the presence of input constraints (not shown). Since ka = 0.5 is the limiting value 
obtained from (5.64), this seems to indicate that (5.64) is a very tight condition, but 
one should be careful about making such a conclusion. First, (5.64) was derived by 
considering sinusoids and the responses in this example are for a step disturbance. 
Furthermore, for other controllers the values of ka for which instability occurs will 
be different. 


For unstable plants, reference changes can also drive the system into input 
saturation and instability. But in contrast to disturbance changes, one then 
has the option to use a two degrees-of-freedom controller to filter the reference 
signal and thus reduce the magnitude of the manipulated input. 


5.12 Limitations imposed by phase lag 


We already know that phase lag from RHP-zeros and time delays is a 
fundamental problem, but are there any limitations imposed by the phase 
lag resulting from minimum-phase elements? The answer is both no and yes: 
No, there are no fundamental limitations, but Yes, there are often limitations 
on practical designs. 

As an example, consider a minimum-phase plant of the form 


k k 


CWS (+n) +78) +738) [fd +78) (67) 


where n is three or larger. At high frequencies the gain drops sharply with 
frequency, |G(jw)| = (k/ [| m)w~”. From condition (5.60), it is therefore likely 
(at least if k is small) that we encounter problems with input saturation. 
Otherwise, the presence of high-order lags does not present any fundamental 
limitations. 

However, in practice a large phase lag at high frequencies, e.g. /G(jw) > 
—n - 90° for the plant in (5.67), poses a problem (independent of kK’) even 
when input saturation is not an issue. This is because for stability we need a 
positive phase margin, i.e. the phase of L = GK must be larger than —180° 
at the gain crossover frequency w, That is, for stability we need w, < w1g0; 
see (2.28). 

In principle, wig9 (the frequency at which the phase lag around the loop is 
—180°) is not directly related to phase lag in the plant, but in most practical 
cases there is a close relationship. Define w,, as the frequency where the phase 
lag in the plant G is —180°, i.e. 


LG(jwy) = —180° 
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Note that w, depends only on the plant model. Then, with a proportional 
controller we have that w1g9 = w,, and with a PL-controller wigq < u,,. Thus 
with these simple controllers a phase lag in the plant does pose a fundamental 
limitation: 


Stability bound for P- or PI-control: | we < wy (5.68) 


Note that this is a strict bound to get stability, and for performance (phase 
and gain margin) we typically need w- less than bout 0.5wy. 

If we want to extend the gain crossover frequency w, beyond w,,, we must 
place zeros in the controller (e.g. “derivative action”) to provide phase lead 
which counteracts the negative phase in the plant. A commonly used controller 
is the PID controller which has a maximum phase lead of 90° at high frequency. 
In practice, the maximum phase lead is smaller than 90°. For example, an 
industrial cascade PID controller typically has derivative action over only one 


decade, 
TIst+1 tTps+1 
K(s)=K, 5.69 
(8) 718 Odtps+1 ee) 


and the maximum phase lead is 55° (which is the maximum phase lead of the 

term eth). This is also a reasonable value for the phase margin, so for 
-Atps+1 

performance we approximately require 


Practical performance bound (PID control): we < wx (5.70) 


We stress again that plant phase lag does not pose a fundamental limitation if 
a more complex controller is used. Specifically, if the model is known exactly 
and there are no RHP-zeros or time delays, then one may in theory extend w, 
to infinite frequency. For example, one may simply invert the plant model by 
placing zeros in the controller at the plant poles, and then let the controller 
roll off at high frequencies beyond the dynamics of the plant. However, in 
many practical cases the bound in (5.70) applies because we may want to use 
a simple controller, and also because uncertainty about the plant model often 
makes it difficult to place controller zeros which counteract the plant poles at 
high frequency. 


Remark. Relative order. The relative order (relative degree) of the plant is 
sometimes used as a input-output controllability measure (e.g. Daoutidis and 
Kravaris (1992). The relative order may be defined also for nonlinear plants, and it 
corresponds for linear plants to the pole excess of G(s). For a minimum-phase plant 
the phase lag at infinite frequency is the relative order times —90°. Of course, we 
want the inputs to directly affect the outputs, so we want the relative order to be 
small. However, the practical usefulness of the relative order is rather limited since 
it only gives information at infinite frequency. The phase lag of G(s) as a function 
of frequency, including the value of w~, provides much more information. 
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5.13 Limitations imposed by uncertainty 


The presence of uncertainty requires us to use feedback control rather than 
just feedforward control. The main objective of this section is to gain more 
insight into this statement. A further discussion is given in Section 6.10, where 
we consider MIMO systems. 


5.13.1 Feedforward control 


Consider a plant with the nominal model y = Gu + Gad. Assume that G(s) 
is minimum phase and stable and assume there are no problems with input 
saturation. Then perfect control, , e = y —r = 0, is obtained using a perfect 
feedforward controller which generates the following control inputs 


u=G 'r—-G'Gad (5.71) 
Now consider applying this perfect controller to the actual plant with model 
y =Gut+Gid (5.72) 


Substituting (5.71) into (5.72), we find that the actual control error with the 
“perfect” feedforward controller is 


G! G'/G! 

Sat = 1 £_1) Gd 5.73 
a ag ee ES 
— —— ee” 
rel. error in G rel. error in G/Gq 


Thus, we find for feedforward control that the model error propagates directly 
to the control error. From (5.73) we see that to achieve |e’| < 1 for |d| = 1 
we must require that the relative model error in G/Gq is less than 1/|G")]. 
This requirement is clearly very difficult to satisfy at frequencies where |G",| 
is much larger than 1, and this motivates the need for feedback control. 


5.13.2 Feedback control 


With feedback control the closed-loop response with no model error is y—r = 
S(Ga—r) where S = (I+GK)~' is the sensitivity function. With model error 
we get 

y —r=S'(Gid—r) (5.74) 


where S’ = (I+G'K)~! can be written (see (A.138)) as 


1 
— ST Er 


Uy 


(5.75) 
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Here E = (G’ — G)/G is the relative error for G, and T is the complementary 
sensitivity function. 

From (5.74) we see that the control error is only weakly affected by model 
error at frequencies where feedback is effective (where |S| << 1 and T = 1). 
For example, if we have integral action in the feedback loop and if the feedback 
system with model error is stable, then S(0) = S’(0) = 0 and the steady-state 
control error is zero even with model error. 

Uncertainty at crossover. However, although feedback control counter- 
acts the effect of uncertainty at frequencies where the loop gain is large, un- 
certainty in the crossover frequency region can result in poor performance 
and even instability. This may be analyzed by considering the effect of the 
uncertainty on the gain margin, GM = 1/|L(jwiso)|, where wigo is the fre- 
quency where /D is —180°, see (2.34). Most practical controllers behave as 
a constant gain K, in the crossover region, so |L(jwiso| approrK 5|G(jwrso| 
where Ww g9 © w,, (since the phase lag of the controller is approximately zero 
at this frequency, see also Section 5.12). Here w,, is the frequency where 
LG(jwy,) = —180°. This observation yields the following approximate rule: 


e Uncertainty which keeps |G(jwy)| approximately constant will not change 
the gain margin. Uncertainty which increases |G(jw,)| may yield instability. 


This rule is useful, for example, when evaluating the effect of parametric 
uncertainty. This is illustrated in the following example. 


Example 5.7 Consider a stable first-order delay process, G(s) = ke~°*/(1 +s), 
where the parameters k, 7 and 0 are uncertain in the sense that they may vary with 
operating conditions. If we assume tT > @ then wy & (7/2)/7 and we derive 


2 
|G(jwu)| & =k (5.76) 


We see that to keep |G(jwy)| constant we want k& constant. From (5.76) we see, 
for example, that an increase in 0 increases |G(jwu)|, and may yield instability. 
However, the uncertainty in the parameters is often coupled. For example, the ratio 
7/0 may be approximately constant, in which case an increase in 0 may not affect 
stability, In another case the steady-state gain k may change with operating point, 
but this may not affect stability if the ratio k/t, which determines the high-frequency 
gain, is unchanged. 


The above example illustrates the importance of taking into account the 
structure of the uncertainty, for example, the coupling between the uncertain 
parameters. A robustness analysis which assumes the uncertain parameters 
to be uncorrelated is generally conservative. This is further discussed in 
Chapters 7 and 8. 
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5.14 Summary: Controllability analysis with 
feedback control 


Figure 5.16: Feedback control system 


We will now summarize the results of this chapter by a set of “controllability 
tules”. We use the term “(input-output) controllability” since the bounds 
depend on the plant only, that is, are independent of the specific controller. 
Except for Rule 7, all requirements are fundamental, although some of the 
expressions, as seen from the derivations, are approximate (i.e, they may be 
off by a factor of 2 or thereabouts). However, for practical designs the bounds 
will need to be satisfied to get acceptable performance. 

Consider the control system in Figure 5.16, for the case when all blocks are 
scalar. The model is 


y= G(s)u+ Ga(s)d; Ym = Gm(s)y (5.77) 


Here G',(s) denotes the measurement transfer function and we assume 
G,(0) = 1 (perfect steady-state measurement). The variables d, u, y and r 
are assumed to have been scaled as outlined in Section 1.4, and therefore G(s) 
and Ga(s) are the scaled transfer functions. Let w, denote the gain crossover 
frequency; defined as the frequency where |L(jw)| crosses 1 from above. Let 
wa denote the frequency at which |Ga(jwa)| first crosses 1 from above. The 
following rules apply: 


Rule 1 Speed of response to reject disturbances. We approzimately 
require We > wg. More specifically, with feedback control we require 
|S(jw)| < |1/Ga(jw)| Vw. (See (5.54) and (5.57).) 


Rule 2 Speed of response to track reference changes. We require 
|S(jw)| < 1/R up to the frequency w, where tracking is required. (See 
(5.58).) 
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Rule 3 Input constraints arising from disturbances. For acceptable 
control (jel < 1) we require |G(jw)| > |Ga(jw)| — 1 at frequencies 
where |Ga(jw)| > 1. For perfect control (e = 0) the requirement is 
IG(jw)| > |Ga(jw)|. (See (5.60) and (5.62).) 


Rule 4 Input constraints arising from setpoints. We require |G(jw)| > 
R-—1 up to the frequency w, where tracking is required. (See (5.63).) 


Rule 5 Time delay 9 in G(s)G,,(s). We approximately require w. < 1/6. 
(See (5.28).) 


Rule 6 Tight control at low frequencies with a RHP-zero z in 
G(s)Gm(s). For a real RHP-zero we require we < z/2 and for an 
imaginary RHP-zero we approximately require w. < |z|. (See (5.30) and 
(5.32).) 


Remark. Strictly speaking, a RHP-zero only makes it impossible to have tight 
control in the frequency range close to the location of the RHP-zero. If we 
do not need tight control at low frequencies, then we may reverse the sign of 
the controller gain, and instead achieve tight control at higher frequencies. In 
this case we must for a RHP-zeros approximately require w, > 2z. A special 
case is for plants with a zero at the origin; here we can achieve good transient 
control even though the control has no effect at steady-state. 


Rule 7 Phase lag constraint. We require in most practical cases (e.g., 
with PID control): we < Wy. Here the ultimate frequency uw, is where 
LGGm(jwu) = —180°. (See (5.70).) 


Since time delays (Rule 5) and RHP-zeros (Rule 6) also contribute to 
the phase lag, one may in in most practical cases combine Rules 5, 6 
and 7 into the single rule: we < wy, (Rule 7). 


Rule 8 Real open-loop unstable pole in G(s) at s = p. We need 
high feedback gains to stabilize the system and we approximately require 
we > 2p. (See (5.49).) 


In addition, for unstable plants we need |G'| > |Ga| up to the frequency 
p (which may be larger than wa where |G'a| = 1|). Otherwise, the input 
may saturate when there are disturbances, and the plant cannot be 
stabilized. (See (5.64).) 


Most of the rules are illustrated graphically in Figure 5.17. 

We have not formulated a rule to guard against model uncertainty. The 
reason is that, as shown in (5.74) and (5.75), uncertainty has only a minor 
effect on feedback performance for SISO systems, except at frequencies where 
the relative uncertainty EF approaches 100%, and we obviously have to detune 
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Margins for stability and performance. 
M:: Margin to stay within constraints, |u| < 1. 
M2: Margin for performance, |e| < 1. 
Mz: Margin because of RHP-pole, p. 
Ma : Margin because of RHP-zero, z. 
Ms: Margin because of frequency where plant has —180° phase lag, wx. 
Me: Margin because of delay, 0. 


Figure 5.17: Illustration of controllability requirements 
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the system. Also, since 100% uncertainty at a given frequency allows for the 
presence of a RHP-zero on the imaginary axis at this frequency (G(jw) = 0), 
it is already covered by Rule 6. 

The rules are necessary conditions (“minimum requirements”) to achieve 
acceptable control performance. They are not sufficient since among other 
things we have only considered one effect at a time. 

The rules quantify the qualitative rules given in the introduction. For 
example, the rule “Control outputs that are not self-regulating” may be 
quantified as: “Control outputs y for which |Ga(jw)| > 1 at some frequency” 
(Rule 1). The rule “Select inputs that have a large effect on the outputs” may 
be quantified as: “In terms of scaled variables we must have |G| > |Gq| — 1 
at frequencies where |G'g| > 1 (Rule 3), and we must have |G| > R— 1 at 
frequencies where setpoint tracking is desired (Rule 4)”. Another important 
insight from the above rules is that a larger disturbance or a smaller 
specification on the control error requires faster response (higher bandwidth). 

In summary, Rules 1, 2 and 8 tell us that we need high feedback gain (“fast 
control”) in order to reject disturbances, to track setpoints and to stabilize 
the plant. On the other hand, Rules 5, 6 and 7 tell us that we must use low 
feedback gains in the frequency range where there are RHP-zeros or delays or 
where the plant has a lot of phase lag. We have formulated these requirements 
for high and low gain as bandwidth requirements. If they somehow are in 
conflict then the plant is not controllable and the only remedy is to introduce 
design modifications to the plant. 

Sometimes the problem is that the disturbances are so large that we hit 
input saturation, or the required bandwidth is not achievable. To avoid the 
latter problem, we must at least require that the effect of the disturbance is 
less than 1 (in terms of scaled variables) at frequencies beyond the bandwidth, 
(Rule 1) 


IGa(jw)| <1; Ww > we (5.78) 
where as found above we approximately require w, < 1/6 (Rule 5), w. < 2/2 


(Rule 6) and w. < w, (Rule 7). Condition (5.78) may be used, as in the 
example of Section 5.16.3 below, to determine the size of equipment. 


5.15 Controllability analysis with feedforward 
control 


The above controllability rules apply to feedback control, but we find that 
essentially the same conclusions apply to feedforward control where relevant. 
That is, if a plant is not controllable using feedback control, it is usually not 
controllable with feedforward control. A major difference, as shown below, is 


LIMITATIONS ON PERFORMANCE IN SISO SYSTEMS 213 


that a delay in Ga(s) is an advantage for feedforward control (“it gives the 
feedforward controller more time to make the right action”). Also, a RHP-zero 
in Gq(s) is also an advantage for feedforward control if G(s) has a RHP-zero 
at the same location. Rules 3 and 4 on input constraints apply directly to 
feedforward control, but Rule 8 does not apply since unstable plants can only 
be stabilized by feedback control. The remaining rules in terms of performance 
and “bandwidth” do not apply directly to feedforward control. 

Controllability can be analyzed by considering the feasibility of achieving 
of perfect control. The feedforward controller is 


u = Ka(s)dm 


where dm = Gma(s)d is the measured disturbance. The disturbance response 
with r = 0 becomes 


e= Gu+ Gad = (GKaGma + Ga)d (5.79) 


(Reference tracking can be analyzed similarly by setting Gng = 1 and 
Ga=—R.) 
Perfect control. e = 0 is from (5.79) achieved with the 


Kut — _GGuGhy (5.80) 


This assumes that perc is stable and causal (no prediction), and so 
G7'GGina should have no RHP-zeros and no (positive) delay. From this we 
find that a delay (or RHP-zero) in Ga(s) is an advantage if it cancels a delay 
(or RHP-zero) in GG'ma. 

Ideal control. If perfect control is not possible, then one may analyze 
controllability by considering an “ideal” feedforward controller, Kide!, which 
is (5.80) modified to be stable and causal (no prediction). The controller 
is ideal in that it assumes we have a perfect model. Controllability is then 
analyzed by using Kide! in (5.79). An example is given below in (5.89) and 
(5.90) for a first-order delay process. 

Model uncertainty. As discussed in Section 5.13, model uncertainty is a 
more serious problem for feedforward than for feedback control because there 
is no correction from the output measurement. For disturbance rejection, we 
have from (5.73) that the plant is not controllable with feedforward control if 
the relative model error for G/Gq at any frequency exceeds 1/|G4|. Here Gg 
is the scaled disturbance model. For example, if |Ga(jw)| = 10 then the error 
in G/Ga must not exceed 10% at this frequency. In practice, this means that 
feedforward control has to be combined with feedback control if the output 
is sensitive to the disturbance ()i.e. if |Ga| is much larger than 1 at some 
frequency). 

Combined feedback and feedforward control. To analyze controllabil- 
ity in this case we may assume that the feedforward controller Kg has already 
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been designed. Then from (5.79) the controllability of the remaining feedback 
problem can be analyzed using the rules in Section 5.14 if Ga(s) is replaced 
by 

Gals) = GKaGma + Ga (5.81) 


However, one must beware that the feedforward control may be very sensitive 
to model error, so the benefits of feedforward may be less in practice. 

Conclusion. From (5.81) we see that the primary potential benefit of 
feedforward control is to reduce the effect of the disturbance and make Gu 
less than 1 at frequencies where feedback control is not effective due to, for 
example, a delay or a large phase lag in GGm/(s). 


5.16 Applications of controllability analysis 


5.16.1 First-order delay process 


Problem statement. Consider disturbance rejection for the following 
process 
e798 e 9a8 


Ga(s) = 


=k 
"T4748 


G(s) 


= ke ra (5.82) 
In addition there are measurement delays 6,, for the output and 0,,q for 
the disturbance. All parameters have been appropriately scaled such that at 
each frequency |u| < 1,|d| < 1 and we want |e| < 1. Assume |kg| > 1. Treat 
separately the two cases of i) feedback control only, and ii) feedforward control 
only, and answer the following: 

a) For each of the eight parameters in this model explain qualitatively what 
value you would choose from a controllability point of view (with answers such 
as large, small, value has no effect). 

b) Give quantitative relationships between the parameters which should be 
satisfied to achieve controllability. Assume that appropriate scaling has been 
applied in such a way that the disturbance is less than 1 in magnitude, and 
that the input and the output are required to be less than 1 in magnitude. 

Solution. a) Qualitative. We want the input to have a “large, direct and fast 
effect” on the output, while we want the disturbance to have a “small, indirect 
and slow effect”. By “direct” we mean without any delay or inverse response. 
This leads to the following conclusion. For both feedback and feedforward 
control we want k and rq large, and 7, # and kg small. For feedforward control 
we also want 04 large (we then have more time to react), but for feedback the 
value of 64, does not matter; it translates time, but otherwise has no effect. 
Clearly, we want 6,, small for feedback control (it is not used for feedforward), 
and we want 6mqa small for feedforward control (it is not used for feedback). 
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b) Quantitative. To stay within the input constraints (|u| < 1) we must 
from Rule 4 require |G(jw)| > |Ga(jw)| for frequencies w < wy. Specifically, 
for both feedback and feedforward control 


k > ka; k/t > ka/Ta (5.83) 


Now consider performance where the results for feedback and feedforward 
control differ. i) First consider feedback control. From Rule 1 we need for 
acceptable performance (|e| < 1) with disturbances 


Wq & ka/Ta <We (5.84) 
On the other hand, from Rule 5 we require for stability and performance 
We < 1/6108 (5.85) 


where 6:54 = 9+ Om is the total delay around the loop. The combination of 
(5.84) and (5.85) yields the following requirement for controllability 


Feedback: 0+0m < Ta/ka (5.86) 


ii) For feedforward control, any delay for the disturbance itself yields a 
smaller “net delay”, and to have |e| < 1 we need “only” require 


Feedforward: 6+ 6ma—9a < Ta/ka (5.87) 


Proof of (5.87): Introduce 0-0 +0ma—9a, and consider first the case with Q <0 (so 
(5.87) is clearly satisfied). In this case perfect control is possible using the controller 
(5.80), 


ka 14+ 78 
e 


Kp — 6 'GaGs, = hk 1+7as 


(5.88) 


so we can even achieve e = 0. Next, consider @ > 0. Perfect control is not possible, 


so instead we use the “ideal” controller obtained by deleting the prediction ev. 


_ ka 1+7s 


K ideal = Nd : 
# k 1+ 748 ep!) 
From (5.79) the response with this controller is 
© = (GK G4 + Gad = MO — Fg (5.90 
= d md Be ara ae . 


and to achieve |e|/|d| < 1 we must require hag < 1 (using asymptotic values and 


1—e-* x @ for small x) which is equivalent to (5.87). 
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5.16.2 Application: Room heating 


Consider the problem of maintaining a room at constant temperature, as 
discussed in Section 1.5. Let y be the room temperature, u the heat input 
and d the outdoor temperature. Feedback control should be used. Let the 
measurement delay for temperature (y) be Om = 100s. 


1. Is the plant controllable with respect to disturbances? 

2. Is the plant controllable with respect to setpoint changes of magnitude 
R = 3 (+3 K) when the desired response time for setpoint changes is 
Tr = 1000 s (17 min) ? 


— 


Solution. A critical part of controllability analysis is scaling. A model in 
terms of scaled variables was derived in (1.25) 


20 10 


Cs) = Those? SH) = ipo0shT 


(5.91) 


The frequency responses of |G| and |Gu| are shown in Figure 5.18. 
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Figure 5.18: Frequency responses for room heating example 


1. Disturbances. From Rule 1 feedback control is necessary up to the 
frequency wq = 10/1000 = 0.01 rad/s, where |G4g| crosses 1 in magnitude 
(w. > wa). This is exactly the same frequency as the upper bound given by 
the delay, 1/0 = 0.01 rad/s (w, < 1/0). We therefore conclude that the system 
is barely controllable for this disturbance. From Rule 3 no problems with input 
constraints are expected since |G| > |Ga| at all frequencies. These conclusions 
are supported by the closed-loop simulation in Figure 5.19 (a) for a unit 
step disturbance (corresponding to a sudden 10 K increase in the outdoor 
temperature) using a PID-controller of the form in (5.69) with kK. = 0.4 
(scaled variables), t7- = 200 s and tp = 60 s. The output error exceeds its 
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Figure 5.19: PID feedback control of room heating example 


allowed value of 1 for a very short time after about 100 s, but then returns 
quite quickly to zero. The input goes down to about -0.8 and thus remains 
within its allowed bound of +1. 

2. Setpoints. The plant is controllable with respect to the desired setpoint 
changes. First, the delay is 100 s which is much smaller than the desired 
response time of 1000 s, and thus poses no problem. Second, |G(jw)| > R=3 
up to about w, = 0.007 [rad/s] which is seven times higher than the required 
w, = 1/7, = 0.001 [rad/s]. This means that input constraints pose no problem. 
In fact, we should be able to achieve response times of about 1/w1 = 150 s 
without reaching the input constraints. This is confirmed by the simulation 
in Figure 5.19 (b) for a desired setpoint change 3/(150s + 1) using the same 
PID controller as above. 


5.16.3. Application: Neutralization process 


The following application is interesting in that it shows how the controllability 
analysis tools may assist the engineer in redesigning the process to make it 
controllable. 

One tank. Consider the process in Figure 5.20, where a strong acid with 
pH= —1 (yes, a negative pH is possible — it corresponds to c7+ = 10 mol/l) is 
neutralized by a strong base (pH=15) in a mixing tank with volume V= 10m. 
The problem is to use feedback control to keep the pH in the product stream 
(output y) in the range 7 +1 (“salt water”) by manipulating the amount of 
base, gp (input u) in spite of variations in the flow of acid, q4 (disturbance 
d). The delay in the pH-measurement is 6m = 10s. 

To achieve the desired product with pH=7 one must exactly balance the 
inflow of acid (the disturbance) by addition of base (the manipulated input). 
Intuitively, one might expect that the main control problem is to adjust the 
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Figure 5.20: Neutralization process with one mixing tank 


base accurately by means of a very accurate valve. However, as we will see 
this “feedforward” way of thinking is misleading, and the main hurdle to good 
control is the need for very fast response times. 

We the controlled output to be the excess of acid, c [mol/l], defined as 
Cc = Cy+ — Coy-- In terms of this variable the control objective is to keep 
lc| < Cmax = 10~® mol/I, and the plant is a simple mixing process modelled 
by 
d 
a“? = qaca + qBcB — qc (5.92) 
The nominal values for the acid and base flows are q*, = qj = 0.005 [ m?/s] 
resulting in a product flow g* = 0.01 [m?/s]= 10 [l/s]. Here superscript * 
denotes the steady-state value. Divide each variable by its maximum deviation 
to get the following scaled variables 


= : =~; d= 5.93 
¥* 30-6 Ts 0.59% es) 
Then appropriately scaled linear model for one tank becomes 
ka —2ka F 
G = ; G(s)= ; ka=2.5-10 5.94 
a(s) 14th’ (s) l+ms’ @ ( ) 


where tT, = V/q = 1000 s is the residence time for the liquid in the tank. Note 
that the steady-state gain in terms of scaled variables is more than a million 
so the output is extremely sensitive to both the input and the disturbance. 
The reason for this high gain is the much higher concentration in the two feed 
streams, compared to that desired in the product stream. 
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Figure 5.21: Frequency responses for the neutralization process with one mixing 
tank 


We now proceed with the controllability analysis. The frequency responses 
of Ga(s) and G(s) are shown graphically in Figure 5.21. From Rule 2, input 
constraints do not pose a problem since |G| = 2|Gq| at all frequencies. The 
main control problem is the high disturbance sensitivity, and from (5.84) (Rule 
1) we find the frequency up to which feedback is needed 


wa & ka/t = 2500 rad/s (5.95) 


This requires a response time of 1/2500 = 0.4 millisecond which is clearly 
impossible in a process control application, and is in any case much less than 
the measurement delay of 10 s. 


ACID 


Figure 5.22: Neutralization process with two tanks and one controller 


Design change: Multiple tanks. The only way to improve controllability 
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is to modify the process. This is done in practice by performing the 
neutralization in several steps as illustrated in Figure 5.22 for the case of 
two tanks. This is similar to playing golf where it is often necessary to use 
several strokes to get to the hole. With nm equal mixing tanks in series the 
transfer function for the effect of the disturbance becomes 


1 


FrESTD (5.96) 


Ga(s) = kahn(s);  hn(s) = 


where kg = 2.5- 10° is the gain for the mixing process, h»(s) is the transfer 
function of the mixing tanks, and 7), is the total residence time, Vio1/q. The 
magnitude of h,,(s) as a function of frequency is shown in Figure 5.23 for one 
to four equal tanks in series. 


10 


Magnitude 


is 
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Figure 5.23: Frequency responses for n tanks in series with the same total residence 
time tp; hn(s) =1/(4s+1)", n=1,2,3,4 


From controllability Rules 1 and 5 we must at least require for acceptable 
disturbance rejection that 


|Ga(jwe)] <1] we 21/0 (5.97) 


where @ is the delay in the feedback loop. Thus, one purpose of the mixing 
tanks hn(s) is to reduce the effect of the disturbance by a factor ka(= 2.5-10°) 
at the frequency wa (= 0.1 [rad/s]), i-e. |An(Jwo)| < 1/ka. With 7, = Viot/q we 
obtain the following minimum value for the total volume for n equal tanks in 
series 


Viot = qOny/ (ka)?/” — 1 (5.98) 


where g = 0.01 m?/s. With 6 = 10 s we then find that the following designs 
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have the same controllability with respect to disturbance rejection: 


No. of Total Volume 
tanks volume each tank 


nm Veor [m*] __ [m*] 
1 250000 250000 
2 316 158 
3 40.7 13.6 
4 15.9 3.98 
5 9.51 1.90 
6 6.96 1.16 
ig 5.70 0.81 


With one tank we need a volume corresponding to that of a supertanker to 
get acceptable controllability. The minimum total volume is obtained with 18 
tanks of about 203 liters each — giving a total volume of 3.662 m°. However, 
taking into account the additional cost for extra equipment such as piping, 
mixing, measurements and control, we would probably select a design with 3 
or 4 tanks for this example. 


Control system design 


The condition |Ga(jwe)| < 1 in (5.97), which formed the basis for redesigning 
the process, may be optimistic because it only ensures that we have |S| < 
1/|Ga| at the crossover frequency wp & we & we. However, from Rule 1 we 
also require that |S| < 1/|Ga|, or approximately |L£| > |Gal, at frequencies 
lower than w,, and this may be difficult to achieve since Ga(s) = kah(s) 
is of high order. The problem is that this requires |L| to drop steeply with 
frequency, which results in a large negative phase for L, whereas for stability 
and performance the slope of |L| at crossover should not be greater than —1, 
approximately (see Section 2.6.2). 

Thus, the control system in Figure 5.22 with a single feedback controller 
will not achieve the desired performance. The solution is to install a local 
feedback control system on each tank and to add base in each tank as shown in 
Figure 5.24. This is another plant design change since it requires an additional 
measurement and actuator for each tank. Consider the case of n tanks in series. 
With n controllers the overall closed-loop response from a disturbance into 
the first tank to the pH in the last tank becomes 


Ie 


1 G ” 
y= Gal] (75548 7 Le][ Li (5.99) 
i=1 a : 


where Gg = Tis G; and L; = G;K;, and the approximation applies at low 
frequencies where feedback is effective. 
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Figure 5.24: Neutralization process with two tanks and two controllers. 


In this case, we can design each loop L;(s) with a slope of —1 and bandwidth 
We & We, such that the overall loop transfer function L has slope —n and 
achieves |£| > |Ga| at all frequencies lower than wy (the size of the tanks 
are selected as before such that wa & wy). Thus, our analysis confirms the 
usual recommendation of adding base gradually and having one pH-controller 
for each tank (McMillan, 1984, p. 208). It is unlikely that any other control 
strategy can achieve a sufficiently high roll-off for |Z]. 

In summary, this application has shown how a simple controllability analysis 
may be used to make decisions on both the appropriate size of the equipment, 
and the selection of actuators and measurements for control. Our conclusions 
are in agreement with what is used in industry. Importantly, we arrived at 
these conclusions, without having to design any controllers or perform any 
simulations. Of course, as a final test, the conclusions from the controllability 
analysis should be verified by simulations using a nonlinear model. 


Exercise 5.8 Comparison of local feedback and cascade control. Explain 
why a cascade control system with two measurements (pH in each tank) and only 
one manipulated input (the base flow into the first tank) will not achieve as good 
performance as the control system in Figure 5.24 where we use local feedback with 
two manipulated inputs (one for each tank). (Hint: Show first that the closed-loop 
response for the cascade control system is as in (5.99) but with 


11 =Giki, I2=GiG2Ki Ks, ..., Li=]]G)K; 
j=l 


rather than Li = GiK;.) 
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The following exercise considers the use of buffer tanks for reducing quality 
(concentration, temperature) disturbances in chemical processes. 


Exercise 5.9 (a) The effect of a concentration disturbance must be reduced by a 
factor of 100 at the frequency 0.5 rad/min. The disturbances should be dampened 
by use of buffer tanks and the objective is to minimize the total volume. How many 
tanks in series should one have? What is the total residence time? 

(b) The feed to a distillation column has large variations in concentration and the 
use of one buffer tank is suggest to dampen these. The effect of the feed concentration 
d on the product composition y is given by (scaled variables, time in minutes) 


Ga(s) =e */3s 


(that is, after a step ind the output y will, after an initial delay of 1 min, increase in 
a ramplike fashion and reach its maximum allowed value (which is 1) after another 
3 minutes). Feedback control should be used and there is a additional measurement 
delay of 5 min. What should the residence time in the tank be? 

(c) Show that in terms of minimizing the total volume for buffer tanks in series, 
it is optimal to have buffer tanks of equal size. 

(d) Is there any reason to have buffer tanks in parallel (they must not be of equal 
size because then one may simply combine them)? 

(e) What about parallel pipes in series (pure delay). Is this a good idea? 


Buffer tanks are also used in chemical processes to dampen liquid flowrate 
disturbances (or gas pressure disturbances). This is the topic of the following 
exercise. 


Exercise 5.10 Let d = qin [m?/s] denote a flowrate which acts a disturbance to 
the process. We add a buffer tank (with liquid volume V [m?]), and use a “slow” level 
controller K such that the outflow do = qout (the “new” disturbance) is smoother 
then the inflow qin (the “original” disturbance). The idea is to temporarily increase 
or decrease the liquid volume in the tank to avoid sudden changes in qout. Note that 
the steady-state value of qout must equal that of qin. 

A material balance yields V(s) = (din(S) — dout/8) and with a level controller 
dout(s) = K(s)V(s)i we find that 


d,(s) (5.100) 


The design of a buffer tank for a flowrate disturbance then consists of two steps 


1. Design the level controller K(s) such has h(s) has the desired shape (e.g., 
determined by a controllability analysis of how dz affects the remaining process; 
note that we always must have h(0) = 1). 

2. Design the size of the tank (determine its volume Vmax) such that the tanks does 
not overflow or go empty for the expected disturbances in qin. 
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Assume the inflow varies in the range qi, + 100% where qj, is its nominal value, 
and apply this stepwise procedure to two cases. 

(a) The desired transfer function is h(s) = 1/(7s +1). 

(b) The desired transfer function is h(s) =1/(ms +1)”. 

(c) Explain why it is usually not recommended to have integral action in K(s). 

(d) In case (b) one could alternatively use two tanks in series with controllers 
designed as in (a). Explain why this is most likely not a good solution. (Solution: 
The required total volume is the same, but the cost of two small tanks is larger than 
one large tank). 


5.17 Additional Exercises 


Exercise 5.11 What information about a plant is important for controller design, 
or more specifically, in which frequency range is it important to know the model well? 
To answer this problem you may think about the following sub-problems: 

(a) Explain what information about the plant is used for Ziegler-Nichols tuning of 
a SISO PID-controller. 

(b) Is the steady-state plant gain G(0) important for controller design? (As an 
example consider the plant G(s) = oa with ja| < 1 and design a P-controller 
K(s) = Ke such that we = 100. How does the controller design and the closed-loop 


response depend on the steady-state gain G(0) = 1/a?) 


Exercise 5.12 Let H(s) = Kie~*'*, G(s) = Kye! aera and Ga(s) = 
G(s) H(s). The measurement device for the output has transfer function Gm(s) = 
e825 The unit for time is seconds. The nominal parameter values are: K, = 0.24, 
0, =1 [s], Kz = 38, 02 = 5 [s], and T = 2 [s]. 

(a) Assume all variables have been appropriately scaled. Is the plant input-output 
controllable? 

(b) What is the effect on controllability of changing one model parameter at a time 
in the following ways: 
1. 01 is reduced to 0.1 [s]. 
2. 02 is reduced to 2 [s]. 
3. Ky is reduced to 0.024. 
4. Ke is reduced to 8. 
5. T is increased to 30 [s]. 


Exercise 5.13 A heat exchanger is used to exchange heat between two streams; a 
coolant with flowrate gq (1+1 kg/s) is used to cool a hot stream with inlet temperature 
To (100 + 10°C) to the outlet temperature T (which should be 60 + 10°C). The 
measurement delay for T is 3s. The main disturbance is on To. The following model 
in terms of deviation variables is derived from heat balances 


= 8 __0.6(208 +1) 
T(8) = 0541) (ds +1) 0) + GOs +1128 +1) 


where T and Tp are in °C, q is in kg/s, and the unit for time is seconds. Derive the 
scaled model. Is the plant controllable with feedback control? (Solution: The delay 


To(s) (5.101) 
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poses no problem (performance), but the effect of the disturbance is a bit too large 
at high frequencies (input saturation), so the plant is not controllable). 


Exercise 5.14 The model of an industrial robot arm is as follows 


250(as* + 0.00015 + 100) 
s(as? + 0.0001(500a + 1)s + 100(500a + 1)) 


G(s) = 


where a € [0.0002, 0.002]. Sketch the Bode plot for the two extreme values of a. What 
kind of control performance do you expect? Discuss how you may best represent this 
uncertainty (see Chapter 8). 


5.18 Conclusion 


The chapter has presented a frequency domain controllability analysis for 
scalar systems applicable to both feedback and feedforward control. We 
derived on page 209 a set of controllability rules which are necessary conditions 
(“minimum requirements”) to achieve acceptable control performance. They 
are not sufficient since among other things they only consider one effect at 
a time. The rules may be used to determine whether or not a given plant is 
controllable. indicators”. The method has been applied to a pH neutralization 
process, and it is found that the heuristic design rules given in the literature 
follow directly. The key steps in the analysis are to consider disturbances and 
to scale the variables properly. 

The tools presented in this chapter may also be used to study the 
effectiveness of adding extra manipulated inputs or extra measurements 
(cascade control). They may also be generalized to multivariable plants 
where directionality becomes a further crucial consideration. Interestingly, a 
direct generalization to decentralized control of multivariable plants is rather 
straightforward and involves CLDG and PRGA; see page 472 in Chapter 10. 
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LIMITATIONS ON 
PERFORMANCE IN MIMO 
SYSTEMS 


In this chapter we generalize results of Chapter 5 to MIMO systems. We first discuss 
fundamental limitations on the sensitivity and complementary sensitivity functions 
imposed by the presence of RHP-zeros. We then consider separately the issues of 
functional controllability, RHP-zeros, RHP-poles, disturbances, input constraints 
and uncertainty. Finally, we summarize the main steps in a procedure for analyzing 
the input-output controllability of MIMO plants. 


6.1 Introduction 


Ina MIMO system, disturbances, the plant, RHP-zeros, RHP-poles and delays 
each have directions associated with them. This makes it more difficult to 
consider their effects separately, as we did in the SISO case where we were 
able to reformulate the imposed limitations in terms of bounds on the loop 
gain, |L|, and its crossover frequency, w,. For example, a multivariable plant 
may have a RHP-zero and a RHP-pole at the same location, but their effects 
may not interact if they are in completely different parts of the system; recall 
(4.73). 

We will quantify the directionality of the various effects in G and Gq mainly 
by their output directions: 


yz: output direction of a RHP-zero, see (4.68) 

Yp: Output direction of a RHP-pole, see (4.69) 

ya: Output direction of a disturbance, see (6.30) 

uj: th output direction (singular vector) of the plant, see (3.32)! 


All these are / x 1 vectors where | is the number of outputs. y, and yp are 


A as Se eh OO es SERPS Rare let rk SR eee . . 
Note that u; here is the 7’th output singular vector, and not the i’th input. 
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fixed complex vectors, while ya(s) and u;(s) are frequency-dependent (s may 
here be viewed as a generalized complex frequency; in most cases s = jw). All 
output vectors are normalized such that they have Euclidean length 1, 


lyzll2=1, llypll2=1,  Ilya(s)ll2=1,  [Iua(s)ll2 = 1 


We may also consider the associated input directions of G. However, these 
directions are usually of less interest since we are primarily concerned with 
the performance at the output of the plant. 

The angles between the various output directions can be quantified using 
their inner products: |y/ yp|, ly yal, etc. The inner product gives a number 
between 0 and 1, and from this we can define the angle in the first quadrant, 
see (A.111). For example, the output angle between a pole and a zero is 


$ = arccos |y2"yp| 


to characterize directions: 

We assume throughout this chapter that the models have been scaled as 
outlined in Section 1.4. The scaling procedure is the same as that for SISO 
systems, except that the scaling factors D,, Da, D; and D, are diagonal 
matrices with elements equal to the maximum change in each variable uj, di, 
r; and e;. The control error in terms of scaled variables is then 


e=y—r=Gu+Gad— RF 


where at each frequency we have ||u(w)||max < 1, ||d()|lmax < 1, and 
\|"(w) || max < 1, and the control objective is to achieve |lel|max(w) < 1. Here 
l| - lmax is the vector infinity-norm, that is, the largest element in the vector. 
This norm is sometimes denoted || - ||.., but this is not used here to avoid 
confusing it with the H.. norm of the transfer function (where the oo denotes 
the maximum over frequency rather than the maximum over the elements 
of the vector). As for SISO, systems we see that reference changes may be 
analyzed as a special case of disturbances by replacing Gg by —R. 

Whether various disturbances and reference changes should be considered 
separately or simultaneously is a matter of design philosophy. In this chapter 
we mainly consider their effects separately, on the grounds that it is unlikely 
that for several disturbances to attain their worst values simultaneously. This 
leads to necessary conditions for acceptable performance, which involve the 
elements of different matrices rather than the matrix norms. 
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6.2 Constraints on S and T 

6.2.1 S plus T is the identity matrix 

From the identity S+7T =I and (A.48), we get 


|1—a(S)| < o(T) <14+4(S) (6.1) 


1 — a()| < a(S) <144(7) (6.2) 


This shows that we cannot have both S and T small simultaneously and that 
a(S) is large if and only if o(T) is large. 


6.2.2 Sensitivity integrals 


For SISO systems we presented several integral constraints on the sensitivity 
(the waterbed effects). These may be generalized to MIMO systems by using 
the determinant or singular value of S, see Boyd and Barratt (1991) and 
Freudenberg and Looze (1988). For example, the generalization of the Bode 
sensitivity integral in (5.6) may be written 


N, 


[ In | det S(jw)|dw = ie Ino; (S(jw))dw = 7 - s Re(p;) (6.3) 


t=1 


For a stable L(s) the integrals are zero. Other generalizations are also 
available, see Zhou et al. (1996). However, although these relationships are 
interesting, it seems difficult to derive from them concrete bounds on the 
achievable performance. 


6.2.3. Interpolation constraints 


RHP-zero. If G(s) has a RHP-zero at z with output direction y., then for 
internal stability of the feedback system the following interpolation constraints 
must apply: 

ye T(z) =0; y2'S(z) =y2 (6.4) 


In words, (6.4) says that T must have a RHP-zero in the same direction as G 
and that S(z) has an eigenvalue of 1 corresponding to the left eigenvector y-. 


Proof of (6.4): From (4.68) there exists an output direction yz such that y!’G(z) = 0. 
For internal stability, the controller cannot cancel the RHP-zero and it follows that 
L=GK has a RHP-zero in the same direction, i-e., y/ L(z) =0. Now S = (I+ L)~! 
is stable and thus has no RHP-pole at s = z. It then follows from T = LS that 
yi T(z) =0 and y! (I — S$) =0. 
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RHP-pole. If G(s) has a RHP pole at p with output direction yp, then for 
internal stability the following interpolation constraints apply 


S(p)Yp =0; T(p)Yp = Yp (6.5) 


Proof of (6.5): The square matrix L(p) has a RHP-pole at s = p, and if we assume 
that L(s) has no RHP-zeros at s = p then L~'(p) exists and from (4.70) there exists 
an output pole direction yp such that 


L“"(p)yp = 0 (6.6 


Since T is stable, it has no RHP-pole at s = p, so T(p) is finite. It then follows, from 
S=TL~', that S(p)yp =T(p)L~'(p) yp = 0. 


Similar constraints apply to L;, S; and Ty, but these are in terms of the input 
zero and pole directions, uz and up. 


6.2.4 Sensitivity peaks 


Based on the above interpolation constraints we here derive lower bounds on 
the weighted sensitivity functions. The results show that a peak on a(S) larger 
than 1 is unavoidable if the plant has a RHP-zero, and that a peak on a(T) 
larger 1 is unavoidable if the plant has a RHP-pole. In particular, the peaks 
may be large if the plant has both RHP-zeros and RHP-poles. 

The first result, originally due to Zames (1981), directly generalizes the 
SISO condition in (5.16). 


Theorem 6.1 Weighted sensitivity. Suppose the plant G(s) has a RHP- 
zero at s = z. Let wp(s) be any stable scalar weight. Then for closed-loop 
stability the weighted sensitivity function must satisfy 


wr S(s)lloo = maxa(wpS(jw)) > we(2)| (6.7) 


Proof: Introduce the scalar function f(s) = yf wp(s)S(s)y which is analytic in the 
RHP. We then have 


lwPS(s)lloo 2 If (leo 2 |f(2)| = |wr(2)| (6.8) 


The first inequality follows because the singular value measures the maximum gain 
of a matrix independent of direction, and so G(A) > ||Aw||2 and a(A) > ||wAlle 
(see (A.109)) for any vector w with ||w|l2 = 1. The second inequality follows 
from the maximum modulus principle as in the SISO case. The final equality 
follows since wp(s) is a scalar and from the interpolation constraint (6.4) we get 
yz S(z)ye = yz ys = 1. 


The next theorem generalizes the SISO-condition in (5.17). 
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Theorem 6.2 Weighted complementary sensitivity. Suppose the plant 
G(s) has a RHP-pole at s = p. Let wy(s) be any stable scalar weight. Then 
for closed-loop stability the weighted complementary sensitivity function must 
satisfy 

wr(s)P(s)[ho0 = max e(wrT (jx)) > hor (P) (6.9) 


Proof: Introduce the scalar function f(s) = yj! wr(s)T(s)yp which is analytic in the 
RHP since wrT(s) is stable. We then have 


lwrT(s)lloo 2 IF (s)lloo 2 |F()| = wr (p)| (6.10) 


The first inequality follows because the singular value measures the maximum gain 
of a matrix independent of direction and ||yp||2 = 1. The second inequality follows 
from the maximum modulus principle. The final equality follows since wr(s) is a 
scalar and from (6.5) we get y//T(p)yp = Yp Yp = 1. 


The third theorem, which is a direct extension of the SISO result in (5.18), 
generalizes the two above theorems. Consider a plant G(s) with RHP-poles 
p; and RHP-zeros z;. and factorize G(s) in terms of the Blaschke products as 
follows 


G(s) = Bp(s)Gp(s), G(s) = By'(s)Gz(s) 


where B,(s) and B,(s) are stable all-pass transfer matrices (all singular values 
are 1 for s = jw) containing the RHP-poles and RHP-zeros, respectively. By(s) 
is obtained by factorizing to the output one RHP-pole at a time, starting with 
G(s) = By1(s)Gpi(s) where Byi(s) = 1+ 2Reps Tidy Where Fp1 = Ypr is the 
output pole direction for p,. This procedure may be continued to factor out 
po from G'pi(s) where {pe is the output pole direction of Gp, (which need not 
coincide with yp2, the pole direction of Gp), and so on. A similar procedure 


may be used for the RHP-zeros. We get 


Remark. State-space realizations are provided by Zhou et al. (1996, p.145). Note 
that the realization may be complex. 


With this factorization we have the following theorem. 


Theorem 6.3 MIMO sensitivity peak. Suppose that G(s) has N, RHP- 
zeros z; with output directions y.;, and has N, RHP-poles p; with output 
directions Ypi. Define the all-pass transfer matrices given in (6.11) and 
compute the real constants 


1 = IwBp(zA)lla 21; eo1 = [1B (pi)ypalle > 1 (6.12) 
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Then for closed-loop stability the weighted sensitivity function must satisfy for 
each 2; 


|wPS|loo > ery |we(z;)| (6.13) 
and the weighted complementary sensitivity function must satisfy for each p;: 
|wrT loo = cri \wr(pa)| (6.14) 


Proof of ci; in (6.12): Consider here a RHP-zero z with direction y, (the subscript 
j is omitted). Since G has RHP-poles at p;, S must have RHP-zeros at p;, such that 
T = SGK is stable. We may factorize S = TL~' = $,B,'(s) and introduce the 
scalar function f(s) = yf’ wp(s)S1(s)y which is analytic (stable) in the RHP. y is a 
vector of unit length which is free to choose. We then have 


||wPS(8)|loo = [wr Silloo > IIf(S)lloo > |f(2)| = lwr(2)|-lyz Bp(z)y| (6.15) 


The final equality follows since w, is a scalar and yz’ Si(z) = yf S(z)B)(z) = 
y# B,(z). We finally select y such that the lower bound is as large as possible and 
derive ci. To prove that c; > 1, we follow Chen (1995) and introduce the matrix 
V; whose columns together with Ypi form an orthonormal basis for C!*!. Then, 
I= Gpitigs + ViVi", and 


as mi stpi Fant 2 
Bpi(s) = I+ Vip: = Goi + ViVE = [Gp Ve) | re | | Bee 
'p S— Di Pidpr S—p Ptspi a ip 0 T V; 


and we see that all singular values of By;(z) are equal to 1, except for one which is 
\z + pi|/|z — pi| > 1 (since z and p; are both in the RHP). Thus all singular values 
of B,(z) are 1 or larger, so B,(z) is larger or equal to 1 in all directions and c; > 1. 
The proof of co; is similar. 


From Theorem 6.3 we get by selecting wp(s) = 1 and w7(s) = 1 


|S loo > max cj; ||T loo > max C25 (6.17 
zeros 2; poles p; 
Thus, a peak for a(S(jw)) and o(T'(jw)) larger than 1 is unavoidable if the 
plant has both a RHP-pole and a RHP-zero (unless their relative angle is 90°). 
One RHP-pole and RHP-zero. For the case with one RHP-zero z and 
one RHP-pole p we derive from (6.12) 


pop TPP 5 
= —+ ah 
a=a ‘a o+ z= pe cos? d (6.18) 
where ¢ = arccos|y/yp| is the angle between the output directions for the 
pole and zero. We then get that if the pole and zero are aligned in the same 
direction such that y = yp and ¢ = 0, then (6.18) simplifies to the SISO- 


conditions in (5.18) and (5.19) with ¢ = c@ = =e > 1. Conversely, if 


the pole and zero are aligned orthogonally to each other then @ = 90° and 
C = C, = 1 and there is no additional penalty for having both a RHP-pole 
and a RHP-zero. 
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Proof of (6.18): From (6.12) c1 = ||y#’ Bp(z)||2. From (6.16) the projection of y, in 
the direction of the largest singular value of Bp(z) has magnitude |z+ |/|z—p| cosd, 
and the projection onto the remaining subspace is 1- sin ¢, and (6.18) follows. The 
result was first proved by Boyd and Desoer (1985) and an alternative proof is given 
in Chen (1995) who presents a slightly improved bound. 


Below we discuss the implications of these results and provide some 
examples. 


6.3. Functional controllability 


Consider a plant G(s) with | outputs and let r denote the normal rank of 
G(s). In order to control all outputs independently we must require r = I, 
that is, the plant must be functionally controllable. This term was introduced 
by Rosenbrock (1970, p. 70) for square systems, and related concepts are “right 
invertibility” and “output realizability”. We will use the following definition: 


Definition 6.4 Functional controllability. An m-input l-output system 
G(s) is functionally controllable if the normal rank of G(s), denoted r, is 
equal to the number of outputs, |, that is, if G(s) has full row rank. A system 
is functionally uncontrollable if r < 1. 


The normal rank of G(s) is the rank of G(s) at all values of s except 
at a finite number of singularities (which are the zeros of G(s)). The only 
example of a SISO system which is functionally uncontrollable is the system 
G(s) = 0. A square MIMO system is functional uncontrollable if and only if 
det G(s) = 0,Vs. 

A plant is functional uncontrollable if (and only if) o(G(jw)) = 0, Vw. As 
a measure of how close a plant is to being functional uncontrollable we may 
therefore consider o;(G(jw)) (which for the interesting case with as least as 
many inputs as outputs, m > 1, is the minimum singular value, ¢(G(jw))). 

In most cases functional uncontrollability is a structural property of the 
system, that is, it does not depend on specific parameter values, and it may 
often be evaluated from cause-and-effect graphs. A typical example of this is 
when none of the inputs u,; affect a particular output y; which would be the 
case if one of the rows in G(s) was identically zero. Another example is when 
there are fewer inputs than outputs. 

For strictly proper systems, G(s) = C(sI — A)~1B, we have that G(s) is 
functionally uncontrollable if rank(B) <1 (the system is input deficient), or 
if rank(C) <1 (the system is output deficient), or if rank(sI — A) <1 (fewer 
states than outputs). This follows since the rank of a product of matrices is 
less than or equal to the minimum rank of the individual matrices, see (A.34). 


234 MULTIVARIABLE FEEDBACK CONTROL 


If the plant is not functional controllable, i.e. r < 1, then there are 1 —r 
output directions, denoted yo, which cannot be affected. These directions will 
vary with frequency, and we have (analogous to the concept of a zero direction) 


Yo (jw)G (jw) = 0 (6.19) 


From an SVD of G(jw) = UV", the uncontrollable output directions yo(jw) 
are the last 1 — r columns of U(jw). By analyzing these directions, an 
engineer can then decide on whether it is acceptable to keep certain output 


combinations uncontrolled, or if additional actuators are needed to increase 
the rank of G(s). 


Example 6.1 The following plant is singular and thus not functionally controllable 


G(s) = Ee Zul 


842 842 


This is easily seen since column 2 of G(s) is two times column 1. The uncontrollable 
output directions at low and high frequencies are, respectively 


yo(0) = reel yo(co) = wae 


6.4 Limitations imposed by time delays 


Time delays pose limitations also in MIMO systems. Specifically, let 6;; denote 
the time delay in the ij’th element of G(s). Then a lower bound on the time 
delay for output i is given by the smallest delay in row 7 of G(s), that is, 


min __ : 2 
ap = min bij 


This bound is obvious since 6 is the minimum time for any input to affect 
output 7, and 6™™ can be regarded as a delay pinned to output i. 

Holt and Morari (1985a) have derived additional bounds, but their 
usefulness is sometimes limited since they assume a decoupled closed-loop 
response (which is usually not desirable in terms of overall performance) and 
also assume infinite power in the inputs. 

For MIMO systems we have the surprising result that an increased time 
delay may sometimes improve the achievable performance. As a simple 
example, consider the plant 


(6.20) 


With 6 = 0, the plant is singular (not functionally controllable), and 
controlling the two outputs independently is clearly impossible. On the other 
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hand, for 0 > 0, effective feedback control is possible, provided the bandwidth 
is larger than about 1/0. That is, for this example control is easier the larger 
0 is. To see that this is true, we may compute the condition number (or 
the RGA) of G as a function of frequency, and note that it approaches 1 
at frequencies higher than about 1/6. In words, the presence of the delay 
decouples the initial (high-frequency) response, so we can obtain tight control 
if the controller reacts within this initial time period. 


Exercise 6.1 To illustrate the above arguments, determine the sensitivity function 
S for the plant (6.20) using a simple diagonal controller K = ET, Use the 
approximation e~°* = 1—Os to show that at low frequencies the elements of S(s) are 
of magnitude k/(2k +0) How large must k be to have acceptable performance (less 
than 10% offset at low frequencies)? What is the corresponding bandwidth? (Answer: 
Need k > 8/0. Bandwidth is equal to k.) 


Remark. The observant reader may have noted that G(s) in (6.20) is singular at 
s = 0 (even with # nonzero) and thus has a zero at s = 0. Therefore, a controller 
with integral action which cancels this zero, yields an internally unstable system, 
(e.g. the transfer function KS contains an integrator). This means that although 
the conclusion that the time delay helps is correct, the derivations given in Exercise 
6.1 are not strictly correct. To “fix” the results we may assume that the plant is 
only going to be controlled over a limited time so that internal instability and input 
saturation are not issues. Alternatively, we may assume, for example, that e~** is 
replaced by 0.99e~** so that the plant is not singular at steady-state (but it is close 
to singular). 

Exercise 6.2 Repeat Exercise 6.1 with e~°* replaced by 0.99 aes ; plot the 
elements of S(jw) as a function of frequency for k = 0.1/0, k =1/0 andk =8/0. 


6.5 Limitations imposed by RHP-zeros 


RHP-zeros are common in many practical multivariable problems. The 
limitations they impose are similar to those for SISO systems, although often 
not quite so serious as they only apply in particular directions. 

For ideal ISE-optimal control (the “cheap” LQR problem), the SISO 
results in (5.25) and (5.26) can be generalized, see Qiu and Davison (1993). 
Specifically, they show for a MIMO plant with RHP-zeros at z; that the ideal 
ISE-value (the “cheap” LQR cost function) for a step disturbance or reference 
is directly related to 57, 2/z;. Thus, as for SISO systems, RHP-zeros close to 
the origin imply poor control performance. 

The limitations of a RHP-zero located at z may also be derived from the 
bound 

||wPS(s) loo = max |wp(Gw)|o(SGw)) = |wr(2)| (6.21) 
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in (6.7) where wp(s) is a scalar weight. All the results derived in Section 5.6.4 
for SISO systems, therefore generalize if we consider the “worst” direction 
corresponding to the maximum singular value, a(S). For instance, by selecting 
the weight wp(s) such that we require tight control at low frequencies and a 
peak for a(S) less than 2, we derive from (5.37) that the bandwidth (in the 
“worst” direction) must for a real RHP-zero satisfy wR < z/2. Alternatively, if 
we require tight control at high frequencies, then we must from (5.41) satisfy 
WR > 2z. 


Remark 1 The use of a scalar weight wp(s) in (6.21) is somewhat restrictive. 
However, the assumption is less restrictive if one follows the scaling procedure 
in Section 1.4 and scales all outputs by their allowed variations such that their 
magnitudes are of approximately equal importance. 


Remark 2 Note that condition (6.21) involves the maximum singular value (which 
is associated with the “worst” direction), and therefore the RHP-zero may not be a 
limitation in other directions. Furthermore, we may to some extent choose the worst 
direction. This is discussed next. 


6.5.1 Moving the effect of a RHP-zero to a specific 
output 


In MIMO systems one can often move the deteriorating effect of a RHP-zero 
to a given output, which may be less important to control well. This is possible 
because, although the interpolation constraint y# T(z) = 0 imposes a certain 
relationship between the elements within each column of T(s), the columns of 
T(s) may still be selected independently. Let us first consider an example to 
motivate the results that follow. Most of the results in this section are from 
Holt and Morari (1985b) where further extensions can also be found. 


Example 6.2 Consider the plant 


1 iv. a 
Gs) = @as4 ne 41) hes i 


which has a RHP-zero at s = z=0.5. This is the same plant considered in Section 
8.8 where we performed some Ho controller designs. The output zero direction 
satisfies y?G(z) = 0 and we find 


ye= Teel = [ol 


Any allowable T(s) must satisfy the interpolation constraint y T(z) = 0 in (6.4), 
and this imposes the following relationships between the column elements of T(s): 


2t11(z) = toi(z) = 0; 2ti2(z) = too(z) =0 (6.22) 
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We will consider reference tracking y = Tr and examine three possible choices for 
T: To diagonal (a decoupled design), T, with output 1 perfectly controlled, and T2 
with output 2 perfectly controlled. Of course, we cannot achieve perfect control in 
practice, but we make the assumption to simplify our argument. In all three cases, 
we require perfect tracking at steady-state, i.e. T(0) =I. 

A decoupled design has t12(s) = toi(s) = 0, and to satisfy (6.22) we then need 
tii(z) = 0 and to2(z) = 0, so the RHP-zero must be contained in both diagonal 
elements. One possible choice, which also satisfies T(0) = I, is 


=stz 0 
To(s) = | o 23 
stz 


For the two designs with one output perfectly controlled we choose 


1 0 —s+z Bas 
nio=[a: te] mO= [HE AE] 
ste s+z 
The basis for the last two selections is as follows. For the output which is not perfectly 
controlled, the diagonal element must have a RHP-zero to satisfy (6.22), and the off- 
diagonal element must have an s term in the numerator to give T(0) = I. To satisfy 

(6.22), we must then require for the two designs 


The RHP-zero has no effect on output 1 for design T;(s), and no effect on output 2 
for design T2(s). We therefore see that it is indeed possible to move the effect of the 
RHP-zero to a particular output. However, we must pay for this by having to accept 
some interaction. We note that the magnitude of the interaction, as expressed by Gx, 
is largest for the case where output 1 is perfectly controlled. This is reasonable since 
the zero output direction y. = [0.89 —0.45]" is mainly in the direction of output 
1, so we have to “pay more” to push its effect to output 2. This was also observed 
in the controller designs in Section 3.8, see Figure 3.10. 


We see from the above example that by requiring a decoupled response from 
r to y, as in design To(s), we have to accept that the multivariable RHP-zero 
appears as a RHP-zero in each of the diagonal elements of T(s). In other 
words, requiring a decoupled response generally leads to the introduction of 
additional RHP zeros in T(s) which are not present in the plant G(s). 

We also see that we can move the effect of the RHP-zero to a particular 
output, but we then have to accept some interaction. This is stated more 
exactly in the following Theorem. 


Theorem 6.5 Assume that G(s) is square, functionally controllable and 
stable and has a single RHP-zero at s = z and no RHP-pole at s = z. Then 
if the k’th element of the output zero direction is nonzero, i.e. yzr #0, tt is 
possible to obtain “perfect” control on all outputs 7 # k with the remaining 
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output exhibiting no steady-state offset. Specifically, T can be chosen of the 
form 


1 0 0 0 0 0 
0 1 0 0 0 0 
T(s) =| Bs Bos |. Br-18 =stz Betis |. Bns (6.23) 
sHz stz sHtz sHz sHz stz 
0 0 0 0 0 1 
where ; 
B; = —2%4 for j #k (6.24 
Yzk 


Proof: It is clear that (6.23) satisfies the interpolation constraint yz/T(z) = 0; see 
also Holt and Morari (1985b). 


The effect of moving completely the effect of a RHP-zero to output & is 
quantified by (6.24). We see that if the zero is not “naturally” aligned with 
this output, ie., if |yz%| is much smaller than 1, then the interactions will 
be significant, in terms of yielding some 6; = —2y.;/y, much larger than 1 
in magnitude. In particular, we cannot move the effect of a RHP-zero to an 
output corresponding to a zero element in y,, which occurs frequently if we 
have a RHP-zero pinned to a subset of the outputs. 


Exercise 6.3 Consider the plant 


1 
G(s) = | Z (6.25) 
aro 
a) Find the zero and its output direction. (Answer: z = a —landy, = 


[-a 1]"). 

b) Which values of a yield a RHP-zero, and which of these values is best/worst in 
terms of achievable performance? (Answer: We have a RHP-zero for |a| < 1. Best 
fora =0 with zero at infinity; if control at steady-state required then worst fora =1 
with zero at s =0.) 

c) Suppose a = 0.1. Which output is the most difficult to control? Illustrate your 
conclusion using Theorem 6.5. (Answer: Output 2 is the most difficult since the 
zero is mainly in that direction; we get interaction 3 = 20 if we want to control y2 


perfectly.) 


Exercise 6.4 Repeat the above exercise for the plant: 


Bole a ee a (O28) 
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6.6 Limitations imposed by RHP-poles 


From the bound ||wr(s)T(s) |loo. > |wr(p)| in (6.9) we find that a RHP-pole 
p imposes restrictions on ¢(T’) which are identical to those derived on |T| for 
SISO systems in Section 5.8. Thus, we need feedback to stabilize an unstable 
plant and must require that o(T (jw) is about 1 or larger up to the frequency 


2|p|, approximately. 


6.7 RHP-poles combined with RHP-zeros 


For SISO systems we found that performance is poor if the plant has a RHP- 
pole located close to a RHP-zero. This is also the case in MIMO systems, 
provided also the directions coincide. This was quantified in Theorem 6.3. For 
example, for a MIMO plant with single RHP-zero z and single RHP-pole p 
we derive from (6.18) and (6.17) 


|z + pP 


lz — pe cos? d (6.27) 


[Slo 26 [ITllo 2G ¢= yates 


where ¢ = arccos|y// yp| is the angle between the RHP-zero and RHP-pole. We 
next consider an example which demonstrates the importance of the directions 
as expressed by the angle @. 


Example 6.3 Consider the plant 


1 . s—z 
0 cosa —sina 0 
= s—p O.1s+1 $ (es a 
G.(s) = ; ; be a j ts |; z=2,p=3 (6.28) 


Va 


which has for all values of a has a RHP-zero at z= 2 and a RHP-pole at p= 3. 
For a = 0° the rotation matrit U. = I, and the plant consists of two decoupled 
subsystems 
sz 0 
Gols) = | O18+DG-p) . 
me | 0 oniticn | 
Here the subsystem gi1 has both a RHP-pole and a RHP-zero, and closed-loop 
performance is expected to be poor. On the other hand, there are no particular control 
problems related to the subsystem goo. Next, consider a = 90° for which we have 


Bs 0 s+z 
U, = Ei ‘4 , sand Gests) = oe | 
~ Wiest) v 


and we again have two decoupled subsystems, but this time in the off-diagonal 
elements. The main difference, however, is that there is no interaction between the 
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RHP-pole and RHP-zero in this case, so we expect this plant to be easier to control. 
For intermediate values of a we do not have decoupled subsystems, and there will be 
some interaction between the RHP-pole and RHP-zero. 

Since in (6.28) the RHP-pole is located at the output of the plant, its output 
direction is fired and we find yp =[1 0)" for all values of a. On the other hand, 
the RHP-zero output direction changes from [1 O]’ for a = 0° to [0 1)" for 
a = 90°. Thus, the angle ¢ between the pole and zero direction also varies between 
0° and 90°, but ¢ and a are not equal. This is seen from the Table below, where we 
also give c in (combined{2a), for four rotation angles, a = 0°,30°,60° and 90°. 


a 0° 30° 60° 90° 


Ys Pees 


6 = cos! ly yp] | 0° 70.9° 83.4° | 90° 


c 5.0 1.89 1.15 1.0 
|S|loo 7.00 2.60 1.59 1.98 
IT Ilo 7.40 2.76 1.60 1.31 
y(S/KS) 9.55 3.53 2.01 1.59 
phi=0 phi = 70.9 
25 2 
| 34 
| cae 
1 ae 
Or letce OF Steak | 
1 Se laaan es AGn if! 
rr 2 3 4 5 Py 1 2 3 4 5 
phi = 83.4 phi = 90 
2 2 
1}-~ 1 
OS. bee. i 
1 Sate 1 Re eee ieee 
~* 1. 2 3 4 5 ~* 2 3 4 5 
Time Time 


Figure 6.1: MIMO plant with angle ¢ between RHP-pole and RHP-zero. Response 
to step in reference with 7H. controller for four different values of ¢. Solid line: y1; 
Dashed line: y2. 
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The table also shows the values of ||S||o. and ||T ||. obtained by an Hoo optimal 
S/KS design using the following weights 


ae 
Ween we = (2 #8) i a= 2,05 =08 (6.29) 


The weight Wp indicates that we require ||S||oo less than 2, and require tight control 
up to a frequency of about wR = 0.5rad/s. The minimum Hoo norm for the overall 
S/KS problem is given by the value of y in Table 6.8. The corresponding responses 
to a step change in the reference, r=[1 —1], are shown in Figure 6.1. 

Several things about the erample are worth noting: 


1. We see from the simulation for ¢ = a =0° in Figure 6.1 that the response for 
yi is very poor. This is as expected because of the closeness of the RHP-pole and 
zero (z = 2,p = 3). 

2. The bound c on ||S|lo in (6.27) is tight in this case. This can be shown 
numerically by selecting W, = 0.017, we = 0.01 and M = 1 (Wy and 
wp are small so the main objective is to minimize the peak of S). We find 
with these weights that the Hoo designs for the four angles yield ||S|lo = 
5.04, 1.905, 1.155, 1.005, which are very close to c. 

8. The angle ¢ between the pole and zero, is quite different from the rotation angle 
a at intermediate values between 0° and 90°. This is because of the influence of 
the RHP-pole in output 1, which yields a strong gain in this direction, and thus 
tends to push the zero direction towards output 2. 

4. Fora =0° we havec=5 s0 ||S|loo > 5 and ||T|loo > 5 and it is clearly impossible 
to get ||S|loo less than 2, as required by the performance weight Wp. 

5. The Hoo optimal controller is unstable for a =0° and 30°. This is not altogether 
surprising, because for a =0° the plant becomes two SISO systems one of which 
needs an unstable controller to stabilize it since p > z (see Section 5.9). 


In conclusion, pole and zero directions provide useful information about 
a plant, as does the values of c in (6.27). However, the output pole and 
zero directions do depend on the relative scaling of the outputs, which must 
therefore be done appropriately prior to any analysis. 


6.8 Performance requirements imposed by 
disturbances 
For SISO systems we found that large and “fast” disturbances require tight 


control and a large bandwidth. The same results apply to MIMO systems, but 
again the issue of directions is important. 


Definition 6.6 Disturbance direction. Consider a single (scalar) distur- 
bance and let the vector ga represent its effect on the outputs (y = gad), The 
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disturbance direction is defined as 


1 
Yad = —_- Gy (6.30) 
IIgall2 
The associated disturbance condition number is defined as 
ya(G@) = a(G) a(Gtya) (6.31) 


Here Gt is the pseudo-inverse which is G~! for a non-singular G. 


Remark. We here use gq (rather than Ga) to show that we consider a single 
disturbance. For a plant with many disturbances gq corresponds to a given column 
of the matrix Ga. 


The disturbance condition number provides a measure of how a disturbance 
is aligned with the plant. It may vary between 1 (for ya = @) if the disturbance 
is in the “good” direction, and the condition number 7(G) = 4(G)a(G") (for 
Ya = u) if it is in the “bad” direction. Here @ and u are the output directions 
in which the plant has its largest and smallest gain; see Chapter 3. 

In the following, let r = Oand assume that the disturbance has been scaled 
such that at each frequency the worst-case disturbance may be selected as 
|d(w)| = 1. Also assume that the outputs have been scaled such that the 
performance objective is that at each frequency the 2-norm of the error should 
be less than 1, i.e. ||e(w)||2 < 1. With feedback control e = Sggd and the 
performance objective is then satisfied if 


I|Sgall2 = o(Sga)<1Ve <<  |]Sgulloo <1 (6.32) 


For SISO systems, we used this to derive tight bounds on the sensitivity 
function and the loop gain; |S| < 1/|Ga| and |1+ Z| > |Ga|. A similar 
derivation is complicated for MIMO systems because of directions. To see 
this, we can use (6.30) to get the following requirement, which is equivalent 
to (6.32), 

I|Syall2 < 1/llgalle Yo (6.33) 


which shows that the S must be less than 1/||ga||2 only in the direction of yg. 
We can also derive bounds in terms of the singular values of S. Since gq is a 
vector we have from (3.40) 


a(S) ||gall2 < Sgalle < o(5S)|Igall2 (6.34) 
Now a(S) = 1/a(I + L) and a(S) = 1/a(I + L), and we therefore have: 


e For acceptable performance (||Sgall2 < 1) we must at least require that 
o(I +L) is larger than ||gall2 and we may have to require that o(I + L) is 
larger than ||galla. 
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Plant with RHP-zero. If G(s) has a RHP-zero at s = z then the 
performance may be poor if the disturbance is aligned with the output 


direction of this zero. To see this apply the maximum modulus principle to 
f(s) =y! SG to get 


ISgalloo > lyf ga(z)| = lye’ yal - Iga) lle (6.35) 


To satisfy ||Sgalloo < 1 we must then for a given disturbance d at least require 


ly ga(z)| <1 (6.36) 


where yz is the direction of the RHP-zero. This provides a generalization of 
the SISO-condition |Ga(z)| <1 in (5.56). 


Remark. In the above development we consider at each frequency performance 
in terms of |lell2 (the 2-norm). However, the scaling procedure presented in 
Section 1.4 leads naturally to the vector max-norm as the way to measure signals 
and performance. Fortunately, this difference is not too important, and we will 
neglect it in the following. The reason is that for an m x 1 vector a we have 
la||max < |lall2 < Vm |lallmax (see (A.92)) so the values of max- and 2-norms 
are at most a factor \/m apart. 


Example 6.4 Consider the following plant and disturbance models 
1 s—1 4 6 k 
aes) | 45 %o— | » gal) = 5 al pale G8) 


It is assumed that the disturbance and outputs have been appropriately scaled, and 
the question is whether the plant is input-output controllable, i.e. whether we can 
achieve ||Sgalloo <1, for any value of |k| < 1. G(s) has a RHP-zero z = 4 and in 
Example 4.11 on page 144 we have already computed the zero direction. From this 
we get 


ly? Ga(z)| = |[0.83 —0.55]- H | = |0.83k — 0.55| 


and from (6.386) we conclude that the plant is not input-output controllable if 
|0.83k — 0.55| > 1, i.e. if kk < —0.54. We cannot really conclude that the plant 
is controllable for k > —0.54 since (6.36) is only a necessary (and not sufficient) 
condition for acceptable performance, and there may also be other factors that 
determine controllability, such as input constraints which are discussed next. 


Exercise 6.5 Show that the disturbance condition number may be interpreted as 
the ratio between the actual input for disturbance rejection and the input that would 
be needed if the same disturbance was aligned with the “best” plant direction. 
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6.9 Limitations imposed by input constraints 


Manipulated variable constraints can limit the ability to reject disturbances 
and track references. As was done for SISO plants in Chapter 5, we will 
consider the case of perfect control (e = 0) and then of acceptable control 
(\lel| < 1). We derive the results for disturbances, and the corresponding 
results for reference tracking are obtained by replacing Ga by —R. The results 
in this section apply to both feedback and feedforward control. 


Remark. For MIMO systems the choice of vector norm, || - ||, to measure the vector 
signal magnitudes at each frequency makes some difference. The vector max-norm 
(largest element) is the most natural choice when considering input saturation and is 
also the most natural in terms of our scaling procedure. However, for mathematical 
convenience we will also consider the vector 2-norm (Euclidean norm). In most cases 
the difference between these two norms is of little practical significance. 


6.9.1 Inputs for perfect control 


We here consider the question: Can the disturbances be rejected perfectly 
while maintaining ||u|| < 1? To answer this question we must quantify the set 
of possible disturbances and the set of allowed input signals. We will consider 
both the max-norm and 2-norm. 

Max-norm and square plant. For a square plant the input needed 
for perfect disturbance rejection is u = —G~1Gud (as for SISO systems). 
Consider a single disturbance (gq is a vector). Then the worst-case disturbance 
is |d(w)| = 1, and we get that input saturation is avoided (||t||max < 1) if all 


elements in the vector G~1gq are less than 1 in magnitude, that is, 


|G gall max < 1,Vw 


For simultaneous disturbances (Gq is a matrix) the corresponding requirement 
is 


IG" Gallico < 1, Vw (6.38) 


where ||- ||jo0 is the induced maz-norm (induced co-norm, maximum row sum, 
see (A.103)). However, it is usually recommended in a preliminary analysis 
to consider one disturbance at a time, for example, by plotting as a function 
of frequency the individual elements of the matrix G~!Gy. This yields more 
information about which particular input is most likely to saturate and which 
disturbance is the most problematic. 

Two-norm. We here measure both the disturbance and the input in terms 
of the 2norm. Assume that G has full row rank so that the outputs can be 
perfectly controlled. Then the smallest inputs (in terms of the 2-norm) needed 
for perfect disturbance rejection are 


u=—G'Gad (6.39) 
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where Gt = G4 (GG")—? is the pseudo-inverse from (A.62). Then with a 
single disturbance we must require ||Gtga||z < 1. With combined disturbances 
we must require (GtGy) <1, that is, the induced 2-norm is less than 1, see 
(A.104). 

For combined reference changes, ||7(w)||2 < 1, the corresponding condition 
for perfect control with ||u||2 <1 becomes o(GtR) < 1, or equivalently (see 
(A.60)) 

a(R'G) > 1, Ww <u, (6.40) 
where w,. is the frequency up to which reference tracking is required. Usually 
R is diagonal with all elements larger than 1, and we must at least require 


a(G(jw)) > 1,Vw < wp (6.41) 


or, more generally, we want o(G(jw)) large. 


6.9.2 Inputs for acceptable control 


Let r = 0 and consider the response e = Gu + Gad to a disturbance d. The 
question we want to answer in this subsection is: Is it possible to achieve 
llel| < 1 for any ||d|| < 1 using inputs with ||u|| < 1? We use here the max- 
norm, || - ||max (the vector infinity-norm), for the vector signals. 

We consider this problem frequency-by-frequency. This means that we 
neglect the issue of causality which is important for plants with RHP-zeros 
and time delays. The resulting conditions are therefore only necessary (i.e. 
minimum requirements) for achieving |le||max < 1. 


Exact conditions 


Mathematically, the problem can be formulated in several different ways; 
by considering the maximum allowed disturbance, the minimum achievable 
control error or the minimum required input; e.g. see Skogestad and Wolff 
(1992). We here use the latter approach. To simplify the problem, and also to 
provide more insight, we consider one disturbance at a time, i.e. d is a scalar 
and gq a vector. The worst-case disturbance is then |d| = 1 and the problem 
is at each frequency to compute 


Unin min ||| max such that ||Gu + gad||max <1, |d| =1 (6.42) 


A necessary condition for avoiding input saturation (for each disturbance) is 
then 

Umin < 1, Vw (6.43) 
If G and gy are real (i.e. at steady-state) then (6.42) can be formulated as linear 


programming (LP) problem, and in the general case as a convex optimization 
problem. 
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SISO systems. For SISO systems we have an analytical solution. From 
the proof of (5.62) the minimum input for acceptable control (||e(w)||max < 1) 
with a given disturbance is |u| = (|Gad| — 1)/|G|. Thus, to keep |u| < 1 for 
the worst-case disturbance (|d| = 1) we must require 


SISO: |G| > |Ga|—1, at frequencies where |Ga| > 1 (6.44) 


We would like to generalize this result to MIMO systems. Unfortunately, we 
do not have an exact analytical result, but by making the approximation in 
(6.46) below a nice approximate generalization is available. 


Approximate conditions in terms of the SVD 


At each frequency the singular value decomposition of the plant (possibly 
non-square) is G = UNV". Introduce the rotated control error and rotated 
input 

é=U"e, @=Vu (6.45) 


and assume that the max-norm is approximately unchanged by these rotations 
I[Ellmax © |lellmax: — |@#Ilmax * ||tl|max (6.46) 


From (A.122) this would be an equality for the 2-norm, so from (A.92) the 
error by using the approximation for the max-norm is at most a factor ,/m 
where m is the number of elements in the vector. We then find that each 
singular value of G, o;(G), must approximately satisfy 


MIMO: o;(G) > |ulga| —1, at frequencies where |Uf' gal > 1] (6.47) 


where u; is the i’th output singular vector of G, and ga is a vector since we 
consider a single disturbance. More precisely, (6.47) is a necessary condition 
for achieving acceptable control (||e||max < 1) for a single disturbance (|d| = 1) 
with ||u|]max <1, assuming that (6.46) holds. 

Condition (6.47) provides a nice generalization of (6.44). u//gq may be 
interpreted as the projection of gq onto the 7’th output singular vector of 
the plant. 


Proof of (6.47): Let r = 0 and consider the response e = Gu + gad to a single 
disturbance d. We have 


@=U"e =U" (Gut gad) = SG +U" gad (6.48) 


where the last equality follows since U“G = XV. For the worst-case disturbance 
(\d| = 1), we want to find the smallest possible input such that |le||max * ||€llmax 
is less than 1. This is equivalent to requiring |é;| < 1,Vi, where from (6.48) 
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€; = 0;(G)yui + U#! gad. Here Note: uj (a vector) is the i’th column of U, whereas 2%; 
(a scalar) is the i’th rotated plant input. This is a scalar problem similar to that for 
the SISO-case in (5.62), and if we assume |uj/ gad| > 1 (otherwise we may simply 
set %; = 0 and achieve |é;| < 1) then the smallest |u;| is achieved when the right 
hand side is “lined up” to make |e;| = 1. Thus, the minimum input is 


\ats| = (\ui" ga| — 1)/o:(G) (6.49 


and (6.47) follows by requiring that ||1|| max & ||%||max is less than 1. 


Based on (6.47) we can find out: 


1. For which disturbances and at which frequencies input constraints may 
cause problems. This may give ideas on which disturbances should be 
reduced, for example by redesign or use of feedforward control. 

2. In which direction 7 the plant gain is too small. By looking at the 
corresponding input singular vector, vj, one can determine which actuators 
should be redesigned (to get more power in certain directions) and by 
looking at the corresponding output singular vector, u;, one can determine 
on which outputs we may have to reduce our performance requirements. 


Several disturbances. For combined disturbances, one requires the i’th 
row sum of U" Gq to be less than o;(G) (at frequencies where the 7’th row 
sum is larger than 1). However, usually we derive more insight by considering 
one disturbance at a time. 

Reference commands. Similar results are derived for references by 
replacing Ga by —R. 


Example 6.5 Distillation process Consider a2x2 plant with two disturbances. 
The appropriately scaled steady-state model is 


(6.50) 


87.8 —86.4 7.88 8.81 
GSue Be ae a ee rl 


This ts a model of a distillation column with product compositions as outputs, reflux 
and boilup as inputs, and feed rate (20% change) and feed composition (20% change) 
as disturbances. The elements in G are scaled by a factor 0.5 compared to (3.43) 
because the allowed input changes are a factor 2 smaller. From an SVD of G we 
have o(G) = 98.6 and a(G) = 0.70. Some immediate observations: 


1. The elements of the matrit Ga are larger than 1 so control is needed to reject 
disturbances. 

2. Since o(G) = 0.7 we are able to perfectly track reference changes of magnitude 
0.7 (in terms of the 2-norm) without reaching input constraints. 

8. The elements inG are about 5 times larger than those in Ga, which suggests that 
there should be no problems with input constraints. On the other hand, o(G) = 0.7 
is much less than the elements in Ga, so input constraints may be an issue after 
all. 
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4. The disturbance condition numbers, ya(G), for the two disturbances, are 11.75 
and 1.48, respectively. This indicates that the direction of disturbance 1 is less 
favourable than that of disturbance 2. 

5. The condition number y(G) = a(G)/a(G) = 141.7 is large, but this does not by 
itself imply control problems. In this case, the large value of the condition number 
is not caused by a small o(G) (which would be a problem), but rather by a large 
a(G). 

We will now analyze whether the disturbance rejection requirements will cause 
input saturation by considering separately the two cases of perfect control (e = 0) 
and acceptable control (||e||max < 1). 

1. Perfect control. The inputs needed for perfect disturbance rejection are u = 
G-'Ga where 


-1, _ |—1.09 —0.009 
ears Ee ee 
We note that perfect rejection of disturbance d, = 1 requires an input u = 


[—1.09 —1.29]" which is larger than 1 in magnitude. Thus, perfect control of 

disturbance 1 is not possible without violating input constraints. However, perfect 

rejection of disturbance dz = 1 is possible as it requires a much smaller input 

u=[-0.009 —0.213]". 

2. Approximate result for acceptable control. We will use the approrimate 
requirement (6.47) to evaluate the inputs needed for acceptable control. We have 

U" Gi = be aa o1(G) = 98.6 

1.17 0.11 o2(G) = 0.70 


and the magnitude of each element in the i’th row of U"Ga should be less than 
o:(G)+1 to avoid input constraints. In the high-gain direction this is easily satisfied 
since 14.08 and 14.24 are both much less than o1(G) +1 = 99.6, and from (6.49) 
the required input in this direction is thus only about \ui| = (14 — 1)/98.6 = 0.13 
for both disturbances which is much less than 1. The requirement is also satisfied 
in the low-gain direction since 1.17 and 0.11 are both less than o2(G) + 1 = 1.7, 
but we note that the margin is relatively small for disturbance 1. More precisely, in 
the low-gain direction disturbance 1 requires an input magnitude of approximately 
\u2| = (1.17 — 1)/0.7 = 0.24, whereas disturbance 2 requires no control (as its effect 
is 0.11 which is less than 1). 

In conclusion, we find that the results based on perfect control are misleading, 
as acceptable control is indeed possible. Again we find disturbance 1 to be more 
difficult, but the difference is much smaller than with perfect control. The reason is 
that we only need to reject about 12% (1.13—1/1.13) of disturbance 1 in the low-gain 
direction. 

However, this changes drastically if disturbance 1 is larger, since then a much 
larger fraction of it must be rejected. The fact that disturbance 1 is more difficult is 
confirmed in Section 10.11 on page 474 where we also present closed-loop responses. 

3. Exact numerical result for acceptable control. The eract value of the minimum 
inputs needed to achieve acceptable control are |\ul|max = 0.098 for disturbance 1 
and ||u||max = 0.095 for disturbance 2, which confirms that input saturation is not a 
problem. 
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However, the values of |\u||max © 0.10 indicate that the two disturbances are about 
equally difficult. This seems inconsistent with the above approrimate results, where 
we found disturbance 1 to be more difficult. However, the results are consistent if 
for both disturbances control is only needed in the high-gain direction, for which 
the approzimate results gave the same value of 0.13 for both disturbances. (The 
approximate results indicated that some control was needed for disturbance 1 in the 
low-gain direction, since 1.17 was just above 1, but apparently this is inaccurate). 


The discussion at the end of the example illustrates an advantage with the 
approximate method, since there we can easily see whether we are close to a 
borderline value where control may be needed in some direction. On the other 
hand, no such “warning” is provided by the exact numerical method. 

From the example we conclude that it is difficult to judge, simply by looking 
at the magnitude of the elements in Ga, whether a disturbance is difficult to 
reject or not. In the above example, it would appear from the column vectors of 
Gq in (6.50) that the two disturbances have almost identical effects. However, 
we found that disturbance 1 may be much more difficult to reject because 
it has a component of 1.17 in the low-gain direction of G which is about 10 
times larger than the value of 0.11 for disturbance 2. This can be seen from 
the second row of U" Gy. 


Exercise 6.6 Consider again the plant in (6.87). Let k = 1 and compute, as a 
function of frequency, the required inputs G~'ga(jw) for perfect control. You will 
find that both inputs are about 2 in magnitude at low frequency, so if the inputs and 
disturbances have been appropriately scaled, we conclude that perfect control is not 
possible. Next, evaluate G(jw) = UNV", and compute U" ga(jw) as a function of 
frequency and compare with the elements of U(jw) + I to see whether “acceptable 
control” (\e;(jw)| < 1) is possible. 


6.10 Limitations imposed by uncertainty 


The presence of uncertainty requires the use of feedback, rather than simply 
feedforward control, to get acceptable performance. This sensitivity reduction 
with respect to uncertainty is achieved with high-gain feedback, but for any 
real system we have a crossover frequency range where the loop gain has to 
drop below 1, and the presence of uncertainty in this frequency range may 
result in poor performance or even instability. These issues are the same for 
SISO and MIMO systems. 

However, with MIMO systems there is an additional problem in that there is 
also uncertainty associated with the plant directionality. The main objective of 
this section is to introduce some simple tools, like the RGA and the condition 
number, which are useful in indicating for which plants one might expect 
sensitivity to directional uncertainty. 
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Remark. In Chapter 8 we discuss more exact methods for analyzing performance 
with almost any kind of uncertainty and a given controller. This involves analyzing 
the robust performance by use of the structured singular value. However, in this 
section the treatment is kept at a more elementary level and we are looking for 
results which depend on the plant only. 


6.10.1 Input and output uncertainty 


In practice the difference between the true perturbed plant G’ and the plant 
model G is caused by a number of different sources. In this section, we focus 
on input uncertainty and output uncertainty. In a multiplicative (relative) 
form, the output and input uncertainties (as in Figure 6.2) are given by 


Output uncertainty: G’=(I+Eo)G or Eg =(G'-G)G! (6.51) 
Input uncertainty: G!' =G(I+E;) or Er=G4(G'—G)_ (6.52) 


These forms of uncertainty may seem similar, but we will show that their 
implications for control may be very different. If all the elements in the 
matrices E; or Eo are non-zero, then we have full block (“unstructured”) 
uncertainty. However, in many cases the source of uncertainty is in the 
individual input or output channels, and we have that E; or Eo are diagonal 
matrices, for example, 


Er => diag{e1, €25-- By (6.53) 


It is important to stress that this diagonal diagonal output uncertainty. input 


Figure 6.2: Plant with multiplicative input and output uncertainty 


uncertainty is always present in real systems. 


6.10.2 Effect of uncertainty on feedforward control 


Consider a feedforward controller u = K,r for the case with no disturbances 
(d = 0). We assume that the plant G is invertible such that we can select 
K, = G7! and achieve perfect control, e = y-—r = Gu —r = GK,r —r = 0, 
for the nominal case with no uncertainty. However, for the actual plant G’ 
(with uncertainty) the actual control error becomes e! = y'—r = G'G"'r—r. 
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We then get for the two sources of uncertainty 


Output uncertainty: e’ = Eor (6.54) 
Input uncertainty: e! = GE;G tr (6.55) 


For output uncertainty, we see that (6.54) is identical to the result in (5.73) 
for SISO systems. That is, the worst-case relative control error |le’||2/||r||2 is 
equal to the magnitude of the relative output uncertainty ¢(Eo). However, for 
input uncertainty the sensitivity may be much larger because the elements in 
the matrix GE;G~' can be much larger than the elements in Ey. In particular, 
for diagonal input uncertainty the elements of GE;G~1 are directly related 
to the RGA, see (A.79): 


Diagonal uncertainty: [GE;G~‘],, = So di Me; (6.56) 


aa 


The RGA-matrix is scaling independent, which makes the use of condition 
(6.56) attractive. Since diagonal input uncertainty is always present we can 
conclude that 


e if the plant has large RGA elements within the frequency range where 
effective control is desired, then it is not possible to achieve good reference 
tracking with feedforward control because of strong sensitivity to diagonal 
input uncertainty. 


The reverse statement is not true, that is, if the RGA has small elements 
we cannot conclude that the sensitivity to input uncertainty is small. This is 
seen from the following expression for the 2 x 2 case 


r r — 22) _ 
GE;G-! = 1141 + A12€2 oa 11(€1 — €2) (6.57) 


921 
TAIT el —€2) Aa1€1 + Ag2€2 


For example, consider a triangular plant with gi2 = 0. In this case the RGA 
is A =I so the diagonal elements of GE;G~' are e, and eg. Still, the system 
may be sensitive to input uncertainty, since from (6.57) the 2,1-element of 
GE;G~' may be large if goi/gi1 is large. 


6.10.3 Uncertainty and the benefits of feedback 


To illustrate the benefits of feedback control in reducing the sensitivity to 
uncertainty, we consider the effect of output uncertainty on reference tracking. 
As a basis for comparison we first consider feedforward control. 
Feedforward control. Let the nominal transfer function with feedforward 
control be y = T,r where T, = GK, and K, denotes the feedforward 
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controller. Ideally, JT, = I. With model error T/ = G’K;, and the change 
in response is y’ — y = (T/ — T,.)r where 


T. —T, =(G' —G)G"'T, = EoT, (6.58) 


Thus, y'—y = EoT,r = Eoy, and with feedforward control the relative control 
error caused by the uncertainty is equal to the relative output uncertainty. 

Feedback control. With one degree-of-freedom feedback control the 
nominal transfer function is y = Tr where T = L(I + L)~! is the 
complementary sensitivity function. Ideally, T = I. The change in response 
with model error is y’ — y = (T’ — T)r where from (4.143) 


T'—-T=S'EoT (6.59) 
Thus, y’ —y = S'EoTr = S'Eoy, and we see that 


e with feedback control the effect of the uncertainty is reduced by a factor S’ 
relative to that with feedforward control. 


Thus, feedback control is much less sensitive to uncertainty than feedforward 
control at frequencies where feedback is effective and the elements in S’ are 
small. (However, the opposite may be true in the crossover frequency range 
where S’ may have elements larger than 1, see Section 6.10.4.) 


Remark 1 For square plants, Eo = (G' — G)G™! and (6.59) becomes 
AT -T7'=S'.AG-G"! (6.60) 


where AT = T' — T and AG = G' —G. Equation (6.60) provides a generalization 
of Bode’s differential relationship (2.23) for SISO systems. To see this, consider a 
SISO system and let AG — 0. Then S’ — S$ and we have from (6.60) 


ipa (6.61) 


Remark 2 Alternative expressions showing the benefits of feedback control are 
derived by introducing the inverse output multiplicative uncertainty G’ = (I — 
Eio)~'G. We then get (Horowitz and Shaked, 1975). 


Feedforward control: T. —Tp = EioT, (6.62) 
Feedback control: T' —T = SEioT' (6.63) 


(Simple proof for square plants: switch G and G" in (6.58) and (6.59) and use 
Eio = (G' — G)G"" ) 


Remark 3 Another form of (6.59) is (Zames, 1981) 


T’-T=S'(L'-L)s (6.64) 


LIMITATIONS IN MIMO SYSTEMS 253 


Conclusion. From (6.59), (6.63) and (6.64) we see that with feedback control 
T' — T is small at frequencies where feedback is effective (i.e. S and 9S’ are 
small). This is usually at low frequencies. At higher frequencies we have for 
real systems that D is small, so T is small, and again T’ — T is small. Thus 
with feedback, uncertainty only has a significant effect in the crossover region 
where S and T both have norms around 1. 


6.10.4 Uncertainty and the sensitivity peak 


We demonstrated above how feedback may reduce the effect of uncertainty, 
but we also pointed out that uncertainty may pose limitations on achievable 
performance, especially at crossover frequencies. The objective in the following 
is to investigate how the magnitude of the sensitivity, a(S’), is affected by the 
multiplicative output uncertainty and input uncertainty given as (6.51) and 
(6.52). We will derive upper bounds on a(S’) which involve the plant and 
controller condition numbers 

a(x) 

VG) = a(G)’ VK) = o(K) (6.65) 


and the following minimized condition numbers of the plant and the controller 
w(G) = min ¥(GDi), — yo(K) = min y(Dok) (6.66) 


where D; and Do are diagonal scaling matrices. These minimized condition 
numbers may be computed using (A.73) and (A.74). Similarly, we state a lower 
bound on a(S’) for an inverse-based controller in terms of the RGA-matrix of 
the plant. 

The following factorization, of S” in terms of the nominal sensitivity S (see 
Appendix A.6) form the basis for the development: 


Output uncertainty: S'=S(I1+ EoT)! (6.67) 
Input uncertainty: S'=S(14+GE;G"'T)"' = SG(I+ ErTr)~'G~' (6.68) 
S'’=(14+TK'E;K)7'S = K7'(14+ T;Er)' KS (6.69) 


We assume that G and G’ are stable. We also assume closed-loop stability, so 
that both $ and S$" are stable. We then get that (I+ EoT)~!' and (I+ E;T;)~! 
are stable. In most cases we assume that the magnitude of the multiplicative 
(relative) uncertainty at each frequency can be bounded in terms of its singular 
value 


a(Er) < |wyl, a(Eo) < |wol (6.70) 


where w7(s) and wo(s) are scalar weights. Typically the uncertainty bound, 
|wr| or |wo|, is 0.2 at low frequencies and exceeds 1 at higher frequencies. 
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We first state some upper bounds on a(S’). These are based on identities 
(6.67)-(6.69) and singular value inequalities (see Appendix A.3.4) of the kind 


a(I+2:T)") = samy © meu < mapa © Gea 


Of course these inequalities only apply if we assume o(F;Tr) <_ 1, 
o(E7)a(Tr) < 1 and |w;|a(Ty7) < 1. For simplicity, we will not state these 
assumptions each time. 


Upper bound on a(S’) for output uncertainty 


From (6.67) we derive 


a(S’) <a(S)o(I+BoT)) < —f4bR (6.71) 
From (6.71) we see that output uncertainty, be it diagonal or full block, poses 
no particular problem when performance is measured at the plant output. 
That is, if we have a reasonable margin to stability (||(I + ZoT)~"||oo is not 
too much larger than 1), then the nominal and perturbed sensitivities do not 
differ very much. 


Upper bounds on a(S’) for input uncertainty 


The sensitivity function can be much more sensitive to input uncertainty than 
output uncertainty. 

1. General case (full block or diagonal input uncertainty and any controller). 
From (6.68) and (6.69) we derive 


a(S!) <1(G)a(S)a((U+ ET) ") <@)— 6.72) 
HS") <oK)a(S)a(I+TE1)!) <1K) ee (6.73) 


From (6.73) we have the important result that if we use a “round” controller, 
meaning that y(K) is close to 1, then the sensitivity function is not sensitive to 
input uncertainty. In many cases (6.72) and (6.73) are not very useful because 
they yield unnecessarily large upper bounds. To improve on this we present 
below some bounds for special cases, where we either restrict the uncertainty 
to be diagonal or restrict the controller to be of a particular form. 

2. Diagonal uncertainty and decoupling controller. Consider a 
decoupling controller in the form K(s) = D(s)G~1(s) where D(s) is a diagonal 
matrix. In this case, KG is diagonal so T; = KG(I + KG)~ is diagonal, 
and E;T7 is diagonal. The second identity in (6.68) may then be written 
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S' = §(GDr)(I + E;Tr)~-1(GDr)~+ where Dy is freely chosen, and we get 


a(S) <(G)a(S)a(I+ BT) <171@) (6.74) 
a(S") <rB(K)a(S)a(I+ TEV) $7(K) (6.5) 


The bounds (6.74) and (6.75) apply to any decoupling controller in the form 
K = DG"!. In particular, they apply to inverse-based control, D = I(s)I, 
which yields input-output decoupling with 7; = T =t-J where t = ro: 


Remark. A diagonal controller has y4() = 1, so from (6.77) we see that (6.75) 
applies to both a diagonal and decoupling controller. Another bound which does 
apply to any controller is given in (6.77). 


3. Diagonal uncertainty (Any controller). From the first identity in (6.68) 
we get S’ = S(I + (GDr)E;(GD;)!T)~1 and we derive by singular value 
inequalities 


a (5) 
OS) S TH @hw eh 6:6) 
a(S’) ot5) (6.77) 


< TB wa) 


Note that 74 (A) = 1 for a diagonal controller so (6.77) shows that diagonal 
uncertainty does not pose much of a problem when we use decentralized 
control. 


Lower bound on a(S’) for input uncertainty 


Above we derived upper bounds on a(S’); we will next derive a lower bound. 
A lower bound is more useful because tt allows us to make definite conclusions 
about when the plant is not input-output controllable. 


Theorem 6.7 Input uncertainty and inverse-based control. Consider 
a controller K(s) = I(s)G~'(s) which results in a nominally decoupled 
response with sensitivity S = s-I and complementary sensitivity T = t-I where 
t(s) = 1 —s(s). Suppose the plant has diagonal input uncertainty of relative 
magnitude |wr(jw)| in each input channel. Then there exists a combination of 
input uncertainties such that at each frequency 


|wrt| 


a(S’) > a(S) (1+ T+ jor 


IAG) (6.78) 


where ||A(G)|lico is the mazimum row sum of the RGA and a(S) = |s|. 
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The proof is given below. From (6.78) we see that with an inverse based 
controller the worst case sensitivity will be much larger than the nominal at 
frequencies where the plant has large RGA-elements. At frequencies where 
control is effective (|s| is small and |t| ~ 1) this implies that control is not 
as good as expected, but it may still be acceptable. However, at crossover 
frequencies where |s| and |¢| = |1 — s| are both close to 1, we find that a(S’) 
in (6.78) may become much larger than 1 if the plant has large RGA-elements 
at these frequencies. The bound (6.78) applies to diagonal input uncertainty 
and therefore also to full-block input uncertainty (since it is a lower bound). 

Worst-case uncertainty. It is useful to know which combinations of input 
errors give poor performance. For an inverse-based controller a good indicator 
results if we consider GE;G~ 1, where Ey = diag{e,}. If all e, have the same 
magnitude |wy|, then the largest possible magnitude of any diagonal element 
in GE;G"! is given by |wz|- ||A(G)||io00. To obtain this value one may select 
the phase of each e, such that Ze, = —ZA;, where 7 denotes the row of A(G) 
with the largest elements. Also, if A(G) is real (e.g., at steady-state), the signs 
of the e;’s should be the same as those in the row of A(G) with the largest 
elements. 


Proof of Theorem 6.7: (From Skogestad and Havre (1996) and Gjgszeter (1995)). 
Write the sensitivity function as 


S'=(1+G'K)"'=SG(I+ ErTr)"'Gu', Er =diagfe,}, S=sI (6.79) 
———_ 
D 
Since D is a diagonal matrix, we have from (6.56) that the diagonal elements of S’ 
are given in terms of the RGA of the plant G as 


: 1 
sii = 9) Yanda; dy = 2 ASG") (6.80) 
k=1 


14+ te,’ 


(Note that s here is a scalar sensitivity function and not the Laplace variable.) The 
singular value of a matrix is larger than any of its elements, so a(S’) > max; |544|, 
and the objective in the following is to choose a combination of input errors e€, such 
that the worst-case |s/,;| is as large as possible. Consider a given output i and write 
each term in the sum in (6.80) as 


Nik ; Vinten 
1l+te, af 1+ te, 


Nida = (6.81) 


We choose all e, to have the same magnitude |w;(jw)|, so we have ex(jw) = 


jwrlei Ze, We also assume that |te,| < 1 at all frequencies, such that the phase 
of 1+ te, lies between —90° and 90°. ? It is then always possible to select Ze, (the 


2"The assumption |te,| < 1 is not included in the theorem since it is actually needed 
for robust stability, so if it does not hold we may have @(S') infinite for some allowed 
uncertainty, and (6.78) clearly holds. 
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phase of e,) such that the last term in (6.81) is real and negative, and we have at 
each frequency, with these choices for €,, 


344 — a : |Ack| - |tex| 
ae dats 14 a lt + ter] 


3 [Aix | jwrt| |wrt| > 
14 = ri 6.82 
L 1+ furl 1+ |wrt| 2! | (6.82) 


where the first equality makes use of the fact that the row-elements of the RGA 
sum to 1, ()°;_, Ask = 1). The inequality follows since |e,| = |wr| and |1 + tex| < 
1+ |tex] = 1+ |wrt|. This derivation holds for any i (but only for one at a time), 
and (6.78) follows by selecting i to maximize )>;_, |Asn| (the maximum row-sum of 


the RGA of G). 


IV 


We next consider three examples. In the first two, we consider feedforward 
and feedback control of a plant with large RGA-elements. In the third, we 
consider feedback control of a plant with a large condition number, but with 
small RGA-elements. The first two are sensitive to diagonal input uncertainty, 
whereas the third is not. 


Example 6.6 Feedforward control of distillation process. Consider the 
distillation process in (3.77). 


1 87.8 —86.4 35.1 —34.1 
Gus 75s+1 Ee Sel GS ee 35.1 (6-89) 


With Ey = diag{e1,€2} we get for all frequencies 


43.2c; — 43.26. —34.1e, + 35.lee (6.84) 


ane = | Bre 34.le2  —27.7e1 | 
Whe 


We note as expected from (6.57) that the RGA-elements appear on the diagonal 
elements in the matrix GErG~'. The elements in the matrit GErG~' are largest 
when e, and €2 have opposite signs. With a 20% error in each input channel we may 
select €; = 0.2 and e2 = —0.2 and find 


(6.85) 


_; [13.8 -114 
GEG = [in | 


Thus with an “ideal” feedforward controller and 20% input uncertainty, we get from 
(6.55) that the relative tracking error at all frequencies, including steady-state, may 
exceed 1000%. This demonstrates the need for for feedback control. However, this is 
also difficult for this plant as seen in Example 6.7. 


Example 6.7 Feedback control of distillation process. Consider again the 
distillation process G(s) in (6.83) for which we have ||A(G(jw))|lioo = 69.1 and 
V(G) © y7(G) & 141.7 at all frequencies. 


258 MULTIVARIABLE FEEDBACK CONTROL 


1. Inverse based feedback controller. Consider the controller K(s) = 
(0.7/s)G~'(s) corresponding to the nominal sensitivity function 


S(s) = 


ay, 
s+0.7 


The nominal response is excellent, but we found from simulations in Figure 3.12 
that the closed-loop response with 20% input gain uncertainty was extremely poor 
(we used «, = 0.2 and e2 = —0.2). The poor response is easily explained from the 
lower RGA-bound on G(S') in (6.78). With the inverse-based controller we have 
i(s) =k/s which has a nominal phase margin of PM= 90° so from (2.43) we have, 
at frequency we, that |s(jwe)| = |t(jwe)| = 1/V2 = 0.707. With |wr| = 0.2, we then 
get from (6.78) that 


0.707 - 0.2 - 69.1 
1.14 


a(S" (jwe)) > 0.707 (1 ) = 0.707 - 9.56 = 6.76 (6.86) 
(This is close to the peak value in (6.78) of 6.81 at frequency 0.79 rad/min.) Thus, we 
have that with 20% input uncertainty we may have ||S"||o0 > 6.81 and this explains 
the observed poor closed-loop performance. For comparison, the actual peak value 
of a(S’), with the inverse-based controller and uncertainty E; = diag{e1,¢2} = 
diag{0.2, —0.2}, is computed numerically (using skewed-p as discussed below) to be 


-1 
S'loo =H (14 22a f)? Gg =142 
3 0.8 


and occurs at frequency 0.69 rad/min. The difference between the values 6.81 and 
14.2 illustrates that the bound in terms of the RGA is not generally tight, but it is 
nevertheless very useful. 

Neat, we look at the upper bounds. Unfortunately, in this case y;(G) = yo(K) & 
141.7, so the upper bounds on a(S’) in (6.74) and (6.75) are not very tight (they 
are of magnitude 141.7 at high frequencies). 

2. Diagonal (decentralized) feedback controller. Consider the controller 


Kdiag(s) = Baltes) E e ‘ ko = 2.4-107? [min7'] 
8 0 —-1 
The peak value for the upper bound on a(S") in (6.77) is 1.26, 80 we are guaranteed 
|S" loo < 1.26, even with 20% gain uncerainty. For comparison, the actual peak 
in the perturbed sensitivity function with Ey = diag{0.2, —0.2} is ||S"||o = 1.05. Of 
course, the problem with the simple diagonal controller is that (although robust) even 
the nominal response is poor. 


The following example demonstrates that a large plant condition number, 
7(G), does not necessarily imply sensitivity to uncertainty even with an 
inverse-based controller. 


Example 6.8 Feedback control of distillation process, DV-model. [n this 
example we consider the following distillation model given by Skogestad, Morari and 
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Doyle (1988) (it is the same system as studied above but with the DV- rather than 
the LV-configuration for the lower control levels, see Example 10.6): 


1 ee ale (a= Be aaa (6.87) 


~ 75s+1|—108.2 —1.4 0.552 0.448 


We have that ||A(G(jw))|lioo = 1, y(G) © 70.76 and y7(G) & 1.11 at all frequencies. 
Since both the RGA-elements and y7(G) are small we do not expect problems 
with input uncertainty and an inverse based controller. Consider an inverse-based 
controller Kinv(s) = (0.7/s)G~'(s) which yields y(K) =7(G) and ¥6(K) = 77(G). 
In Figure 6.8 we show the lower bound (6.78) given in terms of ||Allicoc and the two 
upper bounds (6.74) and (6.76) given in terms of y;(G) for two different uncertainty 
weights wr. From these curves we see that the bounds are close, and we conclude 
that the plant in (6.87) is robust against input uncertainty even with a decoupling 
controller. 


U1 
2 2 
vo vo 
3 3 
re U2 3 #1 U2 
& & 
a ai & 
sl Tq sl 
-1 0 1 2. -1 0 1 2 
10 10 10 10 10 10 10 10 
Frequency [rad/min] Frequency [rad/min] 
(a) W7(s) = 0.27 (b) Wr(s) = 0.278441 


Figure 6.3: Bounds on the sensitivity function for the distillation column with the 
DV configuration: lower bound L; from (6.78), upper bounds U; from (6.76) and 
U2 from (6.74). 


Remark. Relationship with the structured singular value: skewed-y. To 
analyze exactly the worst-case sensitivity with a given uncertainty |w;| we may 
compute skewed-y (4°). With reference to Section 8.8, this involves computing 
Bx(N) with A= diag(A;, Ap) and N = [eae | and varying p° until 
u(N) = 1. The worst-case performance at a given frequency is then a(S’) = y°(N). 


Example 6.9 Consider the plant 


elt E | 


for which at all frequencies A(G) = I, y(G) = 10*, y*(G@) = 1.00 and y7(G) = 200. 
The RGA-matrix is the identity, but from the large value of yj(G) and (6.57) we 
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know that this plant will be sensitive to diagonal input uncertainty if we use inverse- 
based feedback control, K = qt. This is confirmed if we compute the worst-case 
sensitivity function S’ for G' =G(I+wrAr) where Ar is diagonal and |\wr| = 0.2. 
We find by computing p°(N1) that the peak of a(S’) is |S" loo = 20.43. 

Note that the peak is independent of the controller gain c in this case since G(s) is 
a constant matrix. Also note that with full-block (“unstructured”) input uncertainty 
(Ar is a full matrix) the worst-case sensitivity is ||S"||oo = 1021.7 


Conclusions on input uncertainty and feedback control 


The following statements apply to the frequency range around crossover. By 
“small’, we mean about 2 or smaller. By “large” we mean about 10 or larger. 


1. Condition number 7(G), or y(K), small: robust to both diagonal and 
full-block input uncertainty, see (6.72) and (6.73). 

2. Minimized condition numbers y7(G) or y6(A’) small: robust to diagonal 
input uncertainty, see (6.76) and (6.77). Note that a diagonal controller 
always has y6(K) = 1. 

3. RGA(G) has large elements: inverse-based controller is not robust to 
diagonal input uncertainty, see (6.78). Since diagonal input uncertainty 
is unavoidable in practice, the rule is never to use a decoupling controller 
for a plant with large RGA-elements. Furthermore, a diagonal controller 
will most likely yield poor nominal performance for a plant with large 
RGA-elements, so we conclude that plants with large RGA-elements are 
fundamentally difficult to control. 

4. y7(G) is large while at the same time the RGA has small elements: cannot 
make any definite conclusion about the sensitivity to input uncertainty 
based on the bounds in this section. However, as found in Example 6.9 we 
may expect there to be problems. 


6.10.5 Element-by-element uncertainty 


Above we introduced the RGA as a sensitivity measure with respect to input 
gain uncertainty. In fact, the RGA is an even better sensitivity measure 
with respect to element-by-element uncertainty in the matrix. Consider 
any complex matrix G and let 4; denote the ij’th element in the RGA- 
matrix of G. The following result holds (Yu and Luyben, 1987; Hovd and 
Skogestad, 1992); Appendix A.4): 


Theorem 6.8 The (complet) matrix G becomes singular if we make a relative 
change —1/4j in its ij-th element, that is, if a single element in G is perturbed 
from giz to Gpij = gij(1— x) 
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Thus, the RGA-matrix is a direct measure of sensitivity to element-by- 
element uncertainty and matrices with large RGA-values become singular for 
small relative errors in the elements. 


Example 6.10 The matriz G in (6.83) is non-singular. The 1,2-element of the 
RGA is A12(G) = —34.1. Thus, the matriz G becomes singular if gio = —86.4 is 
perturbed to 


Jp12 = —86.4(1 — 1/(—34.1)) = —88.9 (6.88) 


The above result is an important algebraic property of the RGA, but it also 
has important implications to improve control: 

1) Identification. Models of multivariable plants, G(s), are often obtained 
by identifying one element at a time, for example, using step responses. From 
Theorem 6.8 it is clear that this simple identification procedure will most 
likely give meaningless results (eg., the wrong sign of the steady-state RGA) 
if there are large RGA- elements within the bandwidth where the model is 
intended to be used. 

2) RHP-zeros. Consider a plant with transfer function matrix G(s). If 
the relative uncertainty in an element at a given frequency is larger than 
|1/A;;(jw)| then the plant may be singular at this frequency, implying that the 
uncertainty allows for a RHP-zero on the jw-axis. This is of course detrimental 
to performance both in terms of feedforward and feedback control. 


Remark. Theorem 6.8 seems to “prove” that plants with large RGA-elements are 
fundamentally difficult to control. However, although the statement may be true (see 
the conclusions on page 260), we cannot draw this conclusion from Theorem 6.8. 
This is because the assumption of element-by-element uncertainty is often unrealistic 
from a physical point of view, since the elements are usually coupled in some way. 
For example, this is the case for the distillation column process where we know that 
the elements are coupled due to an underlying physical constraint in such a way 
that the model (6.83) never becomes singular even for large changes in the transfer 
function matrix elements. 


6.10.6 Steady-state condition for integral control 


Feedback control reduces the sensitivity to model uncertainty at frequencies 
where the loop gains are larger. Specifically, with integral action in the 
controller we can achieve zero steady-state control error, even with large model 
errors, provided the sign of the plant, as expressed by det G(0), does not 
change. The statement applies for stable plants, or more generally for cases 
where the number of unstable poles in the plant does not change, and the 
conditions are stated more exactly in the following theorem. 


Theorem 6.9 Let the number of open-loop unstable poles (excluding poles at 
s = 0) of G(s)K(s) and G'(s)K(s) be P and P’, respectively. Assume that 
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the controller K is such that GK has integral action in all channels, and that 
the transfer functions GK and G'K are strictly proper. Then if 


<0 for P — P’ even, including zero 


>0 for P —P’ odd O82) 


det G’ (0) / det G(0) { 
at least one of the following instabilities will occur: a) The negative feedback 
closed-loop system with loop gain GK is unstable. b) The negative feedback 
closed-loop system with loop gain G'K is unstable. 


Proof: For stability of both ([+GK)7~! and (I+G' K)~' we have from Lemma A.11 
in Appendix A.6.3 that det(/ + EoT(s)) needs to encircle the origin P — P’ times 
as s traverses the Nyquist D-contour. Here 7'(0) = I because of the requirement for 
integral action in all channels of GK. Also, since GK and G'K are strictly proper, 
EoT is strictly proper, and hence Eo(s)T(s) — 0 as s — oo. Thus, the map of 
det (I + EoT(s)) starts at det G'(0)/ det G(0) (for s = 0) and ends at 1 (for s = 00). 
A more careful analysis of the Nyquist plot of det(J + EoT(s)) reveals that the 
number of encirclements of the origin will be even for det G’(0)/ det G(0) > 0, and 
odd for det G’(0)/ det G(0) < 0. Thus, if this parity (odd or even) does not match 
that of P — P’ we will get instability, and the theorem follows. 


Example 6.11 Suppose the true model of a plant is given by G(s), and that we 
by careful identification obtain a model Gi(s), 


ee Ee ina Gi(s) = —2 ie Sal 

75s +1 | 108.2 -—109.6 | ’ : 75s+1|109 —108 

At first glance, the identified model seems very good, but it is actually useless 
for control purposes since detGi(0) has the wrong sign; detG(0) = —274.4 and 
det G1 (0) = 196 (also the RGA-elements have the wrong sign; the 1, 1-element in the 
RGA is —47.9 instead of +35.1). From Theorem 6.9 we then get that any controller 
with integral action designed based on the model G1, will yield instability when applied 
to the plant G. 


6.11 Summary: Input-output controllability 


We now summarize the main findings of this chapter in an analysis procedure 
for input-output controllability of a MIMO plant. The presence of directions 
in MIMO systems makes it more difficult to give a precise description of the 
procedure in terms of a set of rules as was done in the SISO case. 


6.11.1 Controllability analysis procedure 


The following procedure assumes that we have made a decision on the plant 
inputs and plant outputs (manipulations and measurements), and we want to 
analyze the model G to find out what control performance can be expected. 
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The procedure can also be used to assist in control structure design (the 
selection of inputs, outputs and control configuration), but it must then be 
repeated for each candidate set of inputs and outputs. In some cases the 
number of possibilities is so large that such an approach becomes prohibitive. 
In this case some pre-screening is required, for example, based on physical 
insight or by analyzing the “large” model, Gan, with all the candidate inputs 
and outputs included. This is briefly discussed in Section 10.4. 

A typical MIMO controllability analysis may proceed as follows: 


1. Scale all variables (inputs u, outputs y, disturbances d, references, 1) to 
obtain a scaled model, y = G(s)u+Ga(s)d, r = RF, see Section 1.4. 

2. Obtain a minimal realization. 

3. Check functional controllability. To be able to control the outputs 
independently, we first need at least as many inputs u as outputs y. Second, 
we need the rank of G(s) to be equal to the number of outputs, J, i.e. the 
minimum singular value of G(jw), a(G) = o1(G), should be nonzero (except 
at possible the jw-axis zeros). If the plant is not functional controllable then 
compute the output direction where the plant has no gain, see (6.19), to 
obtain insight into the source of the problem. 

4. Compute the poles. For RHP (unstable) poles obtain their locations and 
associated directions, see (6.5). “Fast” RHP-poles far from the origin are 
bad. 

5. Compute the zeros. For RHP-zeros obtain their locations and associated 
directions. Look for zeros pinned into certain outputs. “Small” RHP-zeros 
(close to the origin) are bad! 

6. Obtain the frequency response G(jw) and compute the RGA matrix, 
A = Gx (Gt). Plants with large RGA-elements at crossover frequencies 
are difficult to control and should be avoided. For more details about the 
use of the RGA see below. 

7. From now on scaling is critical. Compute the singular values of G(jw) and 
plot them as a function of frequency. Also consider the associated input 
and output singular vectors. 

8. The minimum singular value, o(G(jw)), is a particularly useful 
controllability measure. It should generally be as large as possible at 
frequencies where control is needed. If o(G(jw)) < 1 then we cannot at 
frequency w make independent output changes of unit magnitude by using 
inputs of unit magnitude. 

9. For disturbances and need for control consider the elements of the matrix 
Ga. At frequencies where one, or more elements, is larger than 1, we need 
control. We get more information by considering one disturbance at a time 
(the columns gq of Gq). We must require for each disturbance that S is less 
than 1/||gall2 in the disturbance direction yg, ie. ||Syall2 < 1/llgall2, see 
(6.33). Thus, at least we must require a(S) < 1/||gal|2 and we may have to 
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require a(S) < 1/||gall2; see (6.34). 


Remark. If feedforward control is already used, then one may instead analyze 
Ga(s) = GKaGma+Ga where Ka denotes the feedforward controller, see (5.81). 


10. Disturbances and input saturation: 


First step. Consider the input magnitudes needed for perfect control by 
computing the elements in the matrix GtG4. If all elements are less 
than 1 at all frequencies then input saturation is not expected to be 
a problem. If some elements of G'Gg are larger than 1, then perfect 
control (e = 0) cannot be achieved at this frequency, but “acceptable” 
control (|lel|2 < 1) may be possible, and this may be tested in the 
second step. 


Second step. Check condition (6.47), that is, consider the elements of 
U"Gq and make sure that the elements its the i’th row are smaller 
than o;(G) + 1, at all frequencies. 


11. Are the requirements compatible? Look at disturbances, RHP-poles and 
RHP-zeros and their associated locations and directions. For example, we 
must require for each disturbance and each RHP-zero that |y¥ ga(z)| < 1; 
see (6.35). For combined RHP-zeros and RHP-poles see (6.13) and (6.14). 

12. Uncertainty. If the condition number 7(G) is small then we expect no 
particular problems with uncertainty. If the RGA-elements are large then 
we expect strong sensitivity to uncertainty. For a more detailed analysis 
see the conclusion on page 260 for a more detailed summary. 

13. If decentralized control (diagonal controller) is of interest see the summary 
on page 472. 

14. The use of the condition number and RGA are summarized separately in 
Section 3.6, page 94. 


A controllability analysis may also be used to obtain initial performance 
weights for controller design. After a controller design one may analyze the 
controller by plotting, for example, its elements, singular values, RGA and 
condition number as a function of frequency. 


6.11.2 Plant design changes 


Tf a plant is not input-output controllable, then it must somehow be modified. 
Some possible modifications are listed below. 

Controlled outputs. Identify the output(s) which cannot be controlled 
satisfactorily. Can the specifications for these be relaxed? 

Manipulated inputs. If input constraints are encountered then consider 
replacing or moving actuators. For example, this could mean replacing a 
control valve with a larger one, or moving it closer to the controlled output. 
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If there are RHP-zeros which cause control problems then the zeros may 
often be eliminated by adding another input (possibly resulting in a nonsquare 
plant). This may not be possible if the zero is pinned to a particular output. 

Extra Measurements. If the effect of disturbances, or uncertainty, is 
large, and the dynamics of the plant are such that acceptable control cannot be 
achieved, then consider adding “fast local loops” based on extra measurements 
which are located close to the inputs and disturbances; see Section 10.8.2 and 
the example on page 221. 

Disturbances. If the effect of disturbances is too large, then see whether 
the disturbance itself may be reduced. This may involve adding extra 
equipment to dampen the disturbances, such as a buffer tank in a chemical 
process or a spring in a mechanical system. In other cases this may involve 
improving or changing the control of another part of the system, e.g. we may 
have that a disturbance is actually the manipulated input for another part of 
the system. 

Plant dynamics and time delays. In most cases, controllability is 
improved by making the plant dynamics faster and by reducing time delays. 
An exception to this is a strongly interactive plant, where an increased 
dynamic lag or time delay, may be helpful if it somehow “delays” the effect of 
the interactions; see (6.20). Another more obvious exception is for feedforward 
control of a measured disturbance, where a delay for the disturbance’s effect 
on the outputs is an advantage. 


Example 6.12 Consider a stable 2 x 2 plant 


= 1 
s+1 s+3 


which has a RHP-zero at s = 1 which limits achievable performance. The zero is 
not pinned to a particular output, so it will most likely disappear if we add a third 
manipulated input. Suppose the new plant is 


=~ 1 
po [°4? s+3 | 


1 2 3 


which has no zeros. It is interesting to note that all the three 2x sub-plants do have 
RHP-zeros (located at s =1, 5s =1.5 and s = 3, respectively). 


Exercise 6.7 Design multivariable controllers for the two plants Gi(s) and G2(s) 
in Example 6.12. Perform closed-loop simulations to illustrate that RHP-zeros pose 
no limitations in the latter case. You may use the HoS/KS design method, e.g. see 
page 90. 
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6.12 Additional exercises 


Some of these exercises may benefit from a reading of Chapter 10 on 
decentralized control. In all cases the variables are assumed to be scaled as 
outlined in Section 1.4. 


Exercise 6.8 Analyze inpul-oulput controllability for 


a(s)==4 [aes i 
s+100[ Ar 1 
Compute the zeros and poles, plot the RGA as a function of frequency, etc. 


Exercise 6.9 Analyze the input-output controllability for 


Gia = 1 a a 


(rs + 1)(75+1+4 2a) a Ts+1+a 


where t = 100; consider two cases: (a) a = 20, and (b) a=2. 
Remark. This is a simple “two-mizing-tank” model of a heat exchanger where 


u= Fear y= eel and @ is the number of heat transfer units. 
i out 


Exercise 6.10 Let 


—-10 O 10 1.1 0 0 
sof P sJecne=[8 BoB t 


a) Perform a controllability analysis of G(s). 

b) Let & = Ax + Bu+d and consider a unit disturbance d = [21 22)". Which 
direction (value of 21/22) gives a disturbance that is most difficult to reject (consider 
both RHP-zeros and input saturation)? 

c) Discuss decentralized control of the plant. How would you pair the variables? 


Exercise 6.11 Consider the following two plants. Do you expect any control 
problems? Could decentralized or inverse-based control be used? What pairing would 
you use for decentralized control? 


1 g—] Ss 
Gal) = T55G + 1s + 0) 42 5 | 


_ 1 1 0.1(s — 1) 
Go(s) = (s? +0.1) Los ous aA 


Exercise 6.12 Order the following three plants in terms of their expected ease of 
controllability 


100 95 100e-*  95e78 100 95e75 
SS lian rl fats) = 100 100 Ge)= 


Remember to also consider the sensitivity to input gain uncertainty. 
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Exercise 6.13 Analyze the input-output controllability for 


5000s 2(—5s+1) 
G(s) = Kpogde-+1)(2e-1) Lost 
5s+1 5s+1 


Exercise 6.14 Analyze the input-output controllability for 


100 102 = 
G(s) = Ee io ; gai(s) = | + 9d2 >= 


s4+1 


Which disturbance is the worst? 


Exercise 6.15 Analyze the input-output controllability for the following three 
plants with 2 inputs and 1 output: G(s) =(gi(s) g2(s)) 


(a) 91(s) = g2(s) = $5. 
() n(s)= a, 
(c) 9(s) = 55, 92(s) = S55- 


Design controllers and perform closed-loop simulations of reference tracking to 
complement your analysis. Consider also the input magnitudes. 


Exercise 6.16 Find the poles and zeros and analyze the input-output controllabil- 
ity for 

41 

G(s) = k i 


c+ 


(Remark. A similar model form is encountered for distillation columns controlled 
with the DB-configuration. In which case the physical reason for the model being 
singular at steady-state is that the sum of the two manipulated inputs is fired at 
steady-state, D+ B= F). 


Exercise 6.17 Controllability of an FCC process. Consider the following 
3 x 3 plant 


Yl U1 1 
a G % —a 
ue (s);u)s S03) = Geespaas $1) 
Y3 U3 
16.8(920s? + 32.48 + 1) 30.5(52.18 +1) 4,30(7.288 +1) 
G(s) = f(s) —16.7(75.58 + 1) 31.0(75.8s + 1)(1.58s+1) —1.41(74.6s + 1) 
1.27(—939s + 1) 54.1(57.38 + 1) 5.40 


It turns out that this 3x3 plant is difficult to control, but that acceptable control can 
be achieved with partial control of two outputs with input 3 in manual (not used). 
That is, we have a 2 x 2 control problem. Consider three options for the controlled 


outputs: 
Yi A . V5 El . 3 y 
y2 ¥3 Yy2 — Y3 


In all three case the inputs are u; and u2. Assume that the third input is a disturbance 
(d = us ). 
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a) Based on the zeros of the three 2 x 2 plants, Gi(s), Go(s) and G3(s), which 
choice of outputs do you prefer? Which seems to be the worst? 


It may be useful to know that the zero polynomials: 
a | 5.75- 107s? + 3.92- 107s? + 3.85 - 10s? + 1.22 - 10°s + 1.03 - 10° 
b 4.44 - 10° 5? — 1.05 - 10®s? — 8.61 - 104s — 9.43 - 10? 
5.75 -107s4 — 8.75 - 10%s? — 5.66 - 10°s? + 6.35 - 102s + 1.60 - 102 


have the following roots: 
a 0.570 0.0529 0.0451 0.0132 
b 0.303 | —0.0532 | —0.0132 
c 0.199 | —0.0532 0.0200 | —0.0132 
b) For the preferred choice of outputs in a) do a more detailed analysis of the 
expected control performance (compute poles and zeros, sketch RGAi1, comment on 


possible problems with input constraints (assume the inputs and outputs have been 
properly scaled), discuss the effect of the disturbance, etc.). What type of controller 
would you use? What pairing would you use for decentralized control? 

c) Discuss why the 3 x 3 plant may be difficult to control. 


Remark. This is actually a model of a fluid catalytic cracking (FCC) reactor 
where u = (Fs Fa ke)” represents the circulation, airflow and feed composition, 
and y = (Ti Tey Teg represents three temperatures. Gi(s) ts called Hicks control 
structure and G'3(s) the conventional structure. More details are found in Hovd and 
Skogestad (1993). 


6.13. Conclusion 


We have found that most of the insights into the performance limitations 
of SISO systems developed in Chapter 5 carry over to MIMO systems. For 
RHP-zeros, RHP-poles and disturbances, the issue of directions usually makes 
the limitations less severe for MIMO than for SISO systems. However, the 
situation is usually the opposite with model uncertainty because for MIMO 
systems there is also uncertainty associated with plant directionality. This is 
an issue which is unique to MIMO systems. 

We summarized on page 262 the main steps involved in an analysis of 
input-output controllability of MIMO plants. In particular, we discussed the 
use of the condition number and the RGA as an input-output controllability 
measures. 


if 


UNCERTAINTY AND 
ROBUSTNESS FOR SISO 
SYSTEMS 


In this chapter, we show how to represent uncertainty by real or complex 
perturbations and we analyze robust stability (RS) and robust performance (RP) 
for SISO systems using elementary methods. Chapter 8 is devoted to a more general 
treatment. 


7.1 Introduction to robustness 


A control system is robust if it is insensitive to differences between the actual 
system and the model of the system which was used to design the controller. 
These differences are referred to as model/plant mismatch or simply model 
uncertainty. The key idea in the 7. robust control paradigm we use is to 
check whether the design specifications are satisfied even for the “worst-case” 
uncertainty. 

Our approach is then as follows: 


1. Determine the uncertainty set: find a mathematical representation of the 
model uncertainty (“clarify what we know about what we don’t know”). 

2. Check Robust Stability (RS): determine whether the system remains stable 
for all plants in the uncertainty set. 

3. Check Robust Performance (RP): if RS is satisfied, determine whether the 
performance specifications are met for all plants in the uncertainty set. 


This approach may not always achieve optimal performance. In particular, 
if the worst-case plant rarely or never occurs, other approaches, such 
as optimizing some average performance or using adaptive control, may 
yield better performance. Nevertheless, the linear uncertainty descriptions 
presented in this book are very useful in many practical situations. 
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It should also be appreciated that model uncertainty is not the only 
concern when it comes to robustness. Other considerations include sensor 
and actuator failures, physical constraints, changes in control objectives, 
the opening and closing of loops, etc. Furthermore, if a control design is 
based on an optimization, then robustness problems may also be caused by 
the mathematical objective function not properly describing the real control 
problem. Also, the numerical design algorithms themselves may not be robust. 
However, when we refer to robustness in this book, we mean robustness with 
respect to model uncertainty, and assume that a fixed (linear) controller is 
used. 

To account for model uncertainty we will assume that the dynamic 
behaviour of a plant is described not by a single linear time invariant model 
but by a set II of possible linear time invariant models, sometimes denoted 
the “uncertainty set”. We adopt the following notation: 


II — a set of possible perturbed plant models. 


G(s) € TT — nominal plant model (with no uncertainty). 


G(s) € II and G’(s) € Il — particular perturbed plant models. 


Sometimes Gp is used rather than II to denote the uncertainty set, whereas 
G’ always refers to a specific uncertain plant. The subscript p stands for 
perturbed or possible or II (pick your choice). This should not be confused 
with the subscript capital P, e.g. in wp, which denotes performance. 

We will use a “norm-bounded uncertainty description” where the set II 
is generated by allowing H.. norm-bounded perturbations to the nominal 
plant G(s). We let A denote a normalized perturbation with H,, norm less 
than or equal to 1. This corresponds to a continuous description of the model 
uncertainty, and there will be an infinite number of possible plants G', in the 
set II. 

Next, we need to decide on how to quantify performance for plants Gp 
belonging to the set II. In the robust H. framework, performance is analyzed 
by considering all possible responses, and requiring that even the worst- 
case response satisfies our specification. One disadvantage with this approach 
is that when the uncertainty arises from a large number of independent 
parameters it is unlikely that the worst-case situation will occur, so the 
resulting design may be conservative. 

We will next discuss some sources of model uncertainty and outline how to 
represent these mathematically. 


UNCERTAINTY AND ROBUSTNESS 271 
7.2 Representing uncertainty 


Uncertainty in the plant model may have several origins: 


1. There are always parameters in the linear model which are only known 
approximately or are simply in error. 

2. The parameters in the linear model may vary due to nonlinearities or 
changes in the operating conditions. 

3. Measurement devices have imperfections. This may even give rise to 
uncertainty on the manipulated inputs, since the actual input is often 
measured and adjusted in a cascade manner. For example, this is often 
the case with valves used to measure flow. In other cases limited valve 
resolution may cause input uncertainty. 

4. At high frequencies even the structure and the model order is unknown, 
and the uncertainty will always exceed 100% at some frequency. 

5. Even when a very detailed model is available we may choose to work with 
a simpler (low-order) nominal model and represent the neglected dynamics 
as “uncertainty” . 

6. Finally, the controller implemented may differ from the one obtained by 
solving the synthesis problem. In which case one may include uncertainty 
to allow for controller order reduction and implementation inaccuracies. 


The various sources of model uncertainty mentioned above may be grouped 
into two main classes: 


1. Parametric uncertainty. Here the structure of the model (including the 
order) is known, but some of the parameters are unknown. 

2. Neglected and unmodelled dynamics uncertainty. Here the model 
is in error because of missing dynamics, usually at high frequencies, either 
through deliberate neglect or because of a lack of understanding of the 
physical process. Any model of a real system will contain this source of 
uncertainty. 


Parametric uncertainty will be quantified by assuming that each uncertain 
parameter is bounded within some region [Qmin,@max]. That is, we have 
parameter sets of the form 


Qp = A(1+7.A) 


where @ is the mean parameter value, rag = (Qmax — Qmin)/(Qmax + Qmin) is 
the relative uncertainty in the parameter, and A is any real scalar satisfying 
JAI <1. 

Neglected and unmodelled dynamics uncertainty is somewhat less precise 
and thus more difficult to quantify, but it appears that the frequency domain 
is particularly well suited for this class. This leads to complex perturbations 
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which we normalize such that |/A||.. <1. In this chapter, we will deal mainly 
with this class of perturbations. 

For completeness one may consider a third class of uncertainty (which is 
really a combination of the other two): 


3. Lumped uncertainty. Here the uncertainty description represents one 
or several sources of parametric and/or unmodelled dynamics uncertainty 
combined into a single lumped perturbation of a chosen structure. 


The frequency domain is also well suited for describing lumped uncertainty. In 
most cases we prefer to lump the uncertainty into a multiplicative uncertainty 
of the form 


Figure 7.1: Plant with multiplicative uncertainty 


Ilr: Gp(s) =G(s)(1+wr(s)Ar(s));  |Ar(jw)| <1 Vw (7.1) 
Avo <1 


which may be represented by the block diagram in Figure 7.1. Here Ar(s) is 
any stable transfer function which at each frequency is less than or equal to 
one in magnitude. The stability assumption on A;(s) may be removed if one 
instead assumes that the number of RHP poles in G(s) and G(s) remains 
unchanged, but to simplify the stability proofs we will in this book assume 
that the perturbations are stable. Some examples of allowable A;(s)’s with 
Hoo norm less than one, ||Ar|loo < 1, are 


H=2 1 1 0.1 
s+2? 7s4+1? (58+1)3’ 52401541 


The subscript J denotes “input”, but for SISO systems it doesn’t matter 
whether we consider the perturbation at the input or output of the plant, 
since 


G(1+wyAr) =(1+woAo)G with A7z(s) = Ao(s) and wr(s) = wo(s) 
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Another uncertainty form, which is better suited for representing pole 
uncertainty is the inverse multiplicative uncertainty 


I; F G(s) = G(s)(1 + wir(s)Air(s)) 71; |Air(jw)| < 1 Vw (7.2) 


Even with a stable A;7(s) this form allows for uncertainty in the location of 
an unstable pole, and it also allows for poles crossing between the left and 
right-half planes. 

Parametric uncertainty is sometimes called structured uncertainty as 
it models the uncertainty in a structured manner. Analogously, lumped 
dynamics uncertainty is sometimes called unstructured uncertainty. However, 
one should be careful about using these terms because there can be several 
levels of structure, especially for MIMO systems. 


Remark. Alternative approaches for describing uncertainty and the resulting 
performance may be considered. One approach for parametric uncertainty is to 
assume a probabilistic (e.g. normal) distribution of the parameters, and to consider 
the “average” response. This stochastic uncertainty is, however, difficult to analyze 
exactly. 

Another approach is the multi-model approach in which one considers a finite set 
of alternative models. This approach is well suited for parametric uncertainty as it 
eases the burden of the engineer in representing the uncertainty. Performance may 
be measured in terms of the worst-case or some average of these models’ responses. 
The multi-model approach can also be used when there is unmodelled dynamics 
uncertainty. A problem with the multi-model approach is that it is not clear how to 
pick the set of models such that they represent the limiting (“worst-case”) plants. 

A third approach is differential sensitivity (or local robustness). 


To summarize, there are many ways to define uncertainty, from differential 
sensitivity (local robustness), to stochastic uncertainty and multi-models. 
Weinmann (1991) gives a good overview. In particular, there are several ways 
to handle parametric uncertainty, and of these the Ho» frequency-domain 
approach, used in this book, may not be the best or the simplest, but it can 
handle most situations as we will see! In addition, the frequency-domain is 
excellent for describing neglected or unknown dynamics, and it is very well 
suited when it comes to making simple yet realistic uncertainty descriptions. 


7.3. Parametric uncertainty 


In spite of what is sometimes claimed, parametric uncertainty may also be 
represented in the 1. framework, at least if we restrict the perturbations A 
to be real. This is discussed in more detail in Section 7.7. Here we provide 
just two simple examples. 
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Example 7.1 Gain uncertainty. Let the set of possible plants be 
Gp(8) = kpGo(s);  kmin < kp < kmax (7.3) 


where kp is an uncertain gain and Go(s) is a transfer function with no uncertainty. 
By writing 


= A kmax — kmin > A kmin + kmax 
SNE I ange gt oa 


(7.4) 


where r;, is the relative magnitude of the gain uncertainty and k is the average gain, 
(7.8) may be rewritten as multiplicative uncertainty 


_ 
G(s) 


G,(s) = BGo(s\(1+ re), |Al <1 (7.5) 
_—~ 


where A is areal scalar and G(s) is the nominal plant. We see that the uncertainty 
in (7.5) is in the form of (7.1) with a constant multiplicative weight w1(s) = rr- 
The uncertainty description in (7.5) can also handle cases where the gain changes 
sign (kmin <0 and kmax > 0) corresponding to rp > 1. The usefulness of this is 
rather limited, however, since it is impossible to get any benefit from control for a 
plant where we can have Gp = 0, at least with a linear controller. 


Example 7.2 Time constant uncertainty. Consider a set of plants, with an 
uncertain time constant, given by 


1 
ae a Tmin S Tp < Tmax (7.6) 


Gp(s) = 


By writing t = 7(1+17-,A), similar to (7.4) with |A| <1, the model set (7.6) can 
be rewritten as 


_ Go Ge 1 ; a FETS 
Gol!) = aes epee Dees DE wir(s)A’ US Tes C0) 
a eee 
G(s) 


which is in the inverse multiplicative form of (7.2). Note that it does not make 
physical sense for Tp to change sign, because a value Tp =0~ corresponds to a pole 
at infinity in the RHP, and the corresponding plant would be impossible to stabilize. 
To represent cases in which a pole may cross between the half planes, one should 
instead consider parametric uncertainty in the pole itself, 1/(s +p), as described in 
(7.86). 


As shown by the above examples one can represent parametric uncertainty 
in the H. framework. However, parametric uncertainty is often avoided for 


the following reasons: 


1. It usually requires a large effort to model parametric uncertainty. 
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2. A parametric uncertainty model is somewhat deceiving in the sense that 
it provides a very detailed and accurate description, even though the 
underlying assumptions about the model and the parameters may be much 
less exact. 

3. The exact model structure is required and so unmodelled dynamics cannot 
be dealt with. 

4. Real perturbations are required, which are more difficult to deal with 
mathematically and numerically, especially when it comes to controller 
synthesis. 


Instead, it is more usual to represent parametric uncertainty by complex 
perturbations. For example, simply replace the real perturbation, —1 < A <1 
by a complex perturbation with |A(jw)| < 1. This is of course conservative as 
it introduces possible plants that are not present in the original set. However, 
if there are multiple real perturbations, then the conservatism may be reduced 
by lumping these perturbations into a single complex perturbation (at least 
for the SISO case). Typically, a complex multiplicative perturbation is used, 
e.g. Gp =G(I+wyA). 

How is it possible that we can reduce conservatism by lumping together 
several real perturbations? This will become clearer from the examples in the 
next section, but simply stated the answer is that the complicated uncertainty 
region may become more “disk-shaped”, and may then be more accurately 
represented by a single complex perturbation. 


7.4 Representing uncertainty in the frequency 
domain 


In terms of quantifying unmodelled dynamics uncertainty the frequency- 
domain approach (H..) does not seem to have much competition (when 
compared with other norms). In fact, Owen and Zames (1992) make the 
following observation: “The design of feedback controllers in the presence 
of non-parametric and unstructured uncertainty ... is the reason d’étre for 
Hoo feedback optimization, for if disturbances and plant models are clearly 
parameterized then H.. methods seem to offer no clear advantages over more 
conventional state-space and parametric methods.” 


7.4.1 Uncertainty regions 


To illustrate how parametric uncertainty translates into frequency domain 
uncertainty, consider in Figure 7.2 the Nyquist plots (or regions) generated 
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by the following set of plants 


G,(s) = 


Step 1. At each frequency, a region of complex numbers G'p(jw) is generated 
by varying the three parameters in the ranges given by (7.8). In 
general, these uncertainty regions have complicated shapes and complex 
mathematical descriptions, and are cumbersome to deal with in the 
context of control system design. 


A e, 2<k,0,7<3 (7.8) 
TS+1 


Step 2. We therefore approximate such complex regions as discs (circles) 
as shown in Figure 7.3, resulting in a (complex) additive uncertainty 
description as discussed next. 


Figure 7.2: Uncertainty regions of the Nyquist plot at given frequencies. Data from 
(7.8). 


Remark 1 There is no conservatism introduced in the first step when we go 
from a parametric uncertainty description as in (7.8) to an uncertainty region 
description as in Figure 7.2. This is somewhat surprising since the uncertainty 
regions in Figure 7.2 seem to allow for more uncertainty. For example, they allow 
for “jumps” in G,(jw) from one frequency to the next (e.g. from one corner of a 
region to another). Nevertheless, we derive in this and the next chapter necessary and 
sufficient frequency-by-frequency conditions for robust stability based on uncertainty 
regions. Thus, the only conservatism is in the second step where we approximate the 
original uncertainty region by a larger disc-shaped region as shown in Figure 7.3. 


Remark 2 Exact methods do exist using region mapping techniques iwhich avoid 
the second conservative step. However, as already mentioned these methods are 
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-1 


Figure 7.3: Disc approximation (solid line) of the original uncertainty region 
(dashed line). Plot corresponds to w = 0.2 in Figure 7.2. 


rather complex, and although they may be used in analysis, at least for simple 
systems, they are not really suitable for controller synthesis and will not be pursued 
further in this book. 


7.4.2 Representing uncertainty regions by complex 
perturbations 


We will use disc-shaped regions to represent uncertainty regions as illustrated 
by the Nyquist plots in Figures 7.3 and 7.4. These disc-shaped regions may 
be generated by additive complex norm-bounded perturbations (additive 
uncertainty) around a nominal plant G 


IIl4: G(s) = G(s) +wa(s)Aa(s); |Aa(jw)| < 1Vu (7.9) 


where A4(s) is any stable transfer function which at each frequency is less 
than one in magnitude. How is this possible? If we consider all possible A.4’s, 
then at each frequency A,4(jw) “generates” a disc-shaped region with radius 
1 centred at 0, so G(jw) + w4(jw)A4(jw) generates at each frequency a disc- 
shaped region of radius |w4(jw)| centred at G(jw) as shown in Figure 7.4. 

In most cases w4(s) is a rational transfer function (although this need not 
always be the case). 

One may also view w,(s) as a weight which is introduced in order to 
normalize the perturbation to be less than 1 in magnitude at each frequency. 
Thus only the magnitude of the weight matters, and in order to avoid 
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Figure 7.4: Disc-shaped uncertainty regions generated by complex additive 
uncertainty, Gp =G+wad 


unnecessary problems we always choose w,(s) to be stable and minimum 
phase (this applies to all weights used in this book). 

The disk-shaped regions may alternatively be represented by a multiplicative 
uncertainty description as in (7.1), 


Ilr: Gp(s) =G(s)(1+wr(s)Ar(s));  |Ar(jw)| <1, Vw (7.10) 


By comparing (7.9) and (7.10) we see that for SISO systems the additive and 
multiplicative uncertainty descriptions are equivalent if at each frequency 


Jw (jw) | = wa GGw)|/|GGe)| (7.11) 


However, multiplicative (relative) weights are often preferred because their 
numerical value is more informative. At frequencies where |wy(jw)| > 1 the 
uncertainty exceeds 100% and the Nyquist curve may pass through the origin. 
This follows since, as illustrated in Figure 7.5, the radius of the discs in 
the Nyquist plot, |w4(jw)| = |G(jw)wy(jw)|, then exceeds the distance from 
G(jw) to the origin. At these frequencies we do not know the phase of the 
plant, and we allow for zeros crossing from the left to the right-half plane. 
To see this, consider a frequency wo where |w7(jwo)| > 1. Then there exists a 
|A7| < 1 such that G,(jwo) = 0 in (7.10), that is, there exists a possible plant 
with zeros at s = +jw. For this plant at frequency wo the input has no effect 
on the output, so control has no effect. It then follows that tight control is 
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Figure 7.5: The set of possible plants includes the origin at frequencies where 
|wa(jw)] > |G(jw)|, or equivalently |wr (jw)| > 1. 


not possible at frequencies where |w7(jw)| > 1 (this condition is derived more 
exactly in (7.32). 


7.4.3 Obtaining the weight for complex uncertainty 


Consider a set II of possible plants resulting, for example, from parametric 
uncertainty as in (7.8). We now want to describe this set of plants by a 
single (lumped) complex perturbation, A, or A;. This complex (disk-shaped) 
uncertainty description may be generated as follows: 


1. Select a nominal model G(s). 
2. Additive uncertainty. At each frequency find the smallest radius 14 (w) 
which includes all the possible plants IT: 


Law) = max |Gp(jw) — G(ju)| (7.12) 


If we want a rational transfer function weight, w4(s) (which may not be 
the case if we only want to do analysis), then it must be chosen to cover 
the set, so 


lwa(jw)| > law) Vo (7.13) 
Usually w_4(s) is of low order to simplify the controller design. Furthermore, 


an objective of frequency-domain uncertainty is usually to represent 
uncertainty in a simple straightforward manner. 
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3. Multiplicative (relative) uncertainty. This is often the preferred uncertainty 
form, and we have 


Hy =e | S02) (7.14) 


and with a rational weight 
jw (jee)| > Uw), Wo (7.15) 
Example 7.3 Multiplicative weight for parametric uncertainty. Consider 
again the set of plants with parametric uncertainty given in (7.8) 


k —6s 


Il: Gp) gaat , 2<k, 6,7 <3 (7.16) 


We want to represent this set using multiplicative uncertainty with a rational weight 
wr(s). To simplify subsequent controller design we select a delay-free nominal model 


k 2.5 
7Ts+1 2.58+1 
To obtain l;(w) in (7.14) we consider three values (2, 2.5 and 3) for each of the three 
parameters (k,0,7). (This is not in general guaranteed to yield the worst case as the 
worst case may be at the interior of the intervals.) The corresponding relative errors 


|(Gp — G)/G| are shown as functions of frequency for the 3° = 27 resulting Gp’s in 
Figure 7.6. The curve for lr(w) must at each frequency lie above all the dotted lines, 


G(s) = 


(7.17) 


is 


Magnitude 


10 10 10° 10! 
Frequency 


Figure 7.6: Relative errors for 27 combinations of k,7 and 6 with delay-free nominal 
plant (dotted lines). Solid line: First-order weight |w7i| in (7.18). Dashed line: Third- 
order weight |w7| in (7.19). 


and we find that lr(w) is 0.2 at low frequencies and 2.5 at high frequencies. To derive 
w1(s) we first try a simple first-order weight that matches this limiting behaviour: 
Ts+0.2 


wri(s) = (T72.5)s 41° T=4 (7.18) 
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As seen from the solid line in Figure 7.6 this weight gives a good fit of lr(w), except 
around w = 1 where |wri(jw)| is slightly too small, and so this weight does not include 
all possible plants. To change this so that |wr(jw)| > lr(w) at all frequencies, we can 
multiply wr by a correction factor to lift the gain slightly at w =1. The following 
works well ae , 
8 6s 

wits) =G2(2) s274+14s+1 
as is seen from the dashed line in Figure 7.6. The magnitude of the weight crosses 
1 at about w = 0.26. This seems reasonable since we have neglected the delay in 
our nominal model, which by itself yields 100% uncertainty at a frequency of about 
1/Omax = 0.33 (see Figure 7.8 (a) below). 


(7.19) 


An uncertainty description for the same parametric uncertainty, but with a 
mean-value nominal model (with delay), is given in Exercise 7.9. Parametric 
gain and delay uncertainty (without time constant uncertainty) is discussed 
further in Example 7.5. 


Remark. Pole uncertainty. In the example we represented pole (time constant) 
uncertainty by a multiplicative perturbation, A;. We may even do this for unstable 
plants, provided the poles do not shift between the half planes and one allows A;(s) 
to be unstable. However, if the pole uncertainty is large, and in particular if poles 
can cross form the LHP to the RHP, then one should use an inverse ( “feedback” ) 
uncertainty representation as in (7.2). 
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Figure 7.7: Nyquist plot of G,(jw) at frequency w = 0.5 (dashed region) showing 
complex disc approximations using three options for the nominal model: 

1. Simplified nominal model with no time delay. 

2. Mean parameter values. 

3. Nominal model corresponding to the smallest radius. 
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7.4.4 Choice of nominal model 


For the case with parametric uncertainty, three options for the choice of 
nominal model are 


1. Use a simplified nominal model, e.g. a low-order or delay-free model. 


2. Use the mean parameter values, G(s) = G(s) 
3. Use the central plant in the Nyquist plot (yielding the smallest disc). 


Example 7.4 Consider again the uncertainty set (7.16) used in Example 7.8. 
The Nyquist plot of the three resulting discs at frequency w = 0.5 are shown 
in Figure 7.7. Option 1, with the nominal model 


k 


Ge) Ss +1 


yields the largest uncertainty region, but the nominal model is simpler and this 
facilitates controller design in later stages. Option 2, with the nominal model 
kas 


Gala)= Fea 


is probably the most straightforward choice. Option 3 yields the smallest set, 
but in this case it requires a significant effort to obtain the nominal model; tt 
is not even a rational transfer function and an approximation could yield a 
very high order nominal model. 


A similar example was studied by Wang, Lundstrém and Skogestad 
(1994), who obtained the best controller designs with option 1, although the 
uncertainty region is clearly much larger in this case. The reason for this is 
that the “worst-case region” in the Nyquist plot in Figure 7.7 corresponds 
quite closely to those plants with the most negative phase (at coordinates 
about (—1.5, —1.5)). Thus, the additional plants included in the largest region 
(option 1) are generally easier to control and do not really matter when 
evaluating the worst-case plant with respect to stability or performance. In 
conclusion, at least for SISO plants, we find that for plants with an uncertain 
time delay, it is simplest and sometimes best (!) to use a delay-free nominal 
model, and representing the nominal delay as additional uncertainty. 


Remark. The choice of nominal is only an issue since we are lumping several sources 
of parametric uncertainty into a single multiplicative perturbation. Of course, if we 
use a parametric uncertainty description, based on multiple real perturbations, then 
we should always use the mean parameter values in the nominal model. 


7.4.5 Neglected dynamics represented as uncertainty 


We saw above that one advantage of frequency domain uncertainty 
descriptions is that one can choose to work with a simple nominal model, 


UNCERTAINTY AND ROBUSTNESS 283 


and represent neglected dynamics as uncertainty. We will now consider this 
in a little more detail. Consider a set of plants 


Gp(s) = Go(s) f(s) 


where Go(s) is fixed (and certain). We want to neglect the term f(s) 
(which may be fixed or may be an uncertain set IIy), and represent G', by 
multiplicative uncertainty with a nominal model G = Go. From (7.14) we 
get that the magnitude of the relative uncertainty caused by neglecting the 
dynamics in f(s) is 


Ir(w) = Hees 


= ma jw) —1 7.20 
max, [fliie) — 1 (7.20) 


G 


GS 


Two simple examples illustrate the procedure. 

1. Neglected delay. Let f(s) = e~°% where 0 < 6 < Omax. We want 
to represent Gy» = Go(s)e—°* by a delay-free plant Go(s) and multiplicative 
uncertainty. Let us first consider the maximum delay, for which the relative 
error |1 — e~/#%max| is shown as a function of frequency in Figure 7.8(a). The 
relative uncertainty crosses 1 in magnitude at about frequency 1/Omax, reaches 
2 at frequency 7/6 (since at this frequency e”? = —1), and oscillates between 
0 and 2 at higher frequencies (which corresponds to the Nyquist plot of e~J“° 
going around and around the unit circle). Similar curves are generated for 
smaller values of the delay, and they also oscillate between 0 and 2 but at 
higher frequencies. It then follows that if we consider all 0 € [0, Omax] then the 
relative error bound is 2 at frequencies above 7/@max, and we have 


ffl en GeO max| wy < 1 /Omax 
1 (w) = { : ea ie (7.21) 


Rational approximations of (7.21) are given in (7.25) and (7.26) with r, =0. 

2. Neglected lag. Let f(s) =1/(rs +1) where 0 < r < Tmax. In this case 
the resulting 17(w), which is shown in Figure 7.8 (b), can be represented by a 
rational transfer function with |w7(jw)| = 1;(w) where 


1 Tmax § 


TmaxS + 1 - TmaxS + 1 


wr(s) =1 


This weight approaches 1 at high frequency, and the low-frequency asymptote 
crosses 1 at frequency 1/7max- 


Example 7.5 Multiplicative weight for gain and delay uncertainty. 
Consider the following set of plants 


Gip(s) = kpe *Go(s); kp € [Kmin, kmax], 4p € [Omin, Omax] (7.22) 
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Figure 7.8: Multiplicative uncertainty resulting from neglected dynamics 


which we want to represent by multiplicative uncertainty and a delay-free nominal 
model, G(s) = kGo(s). Lundsirém (1994) derived the following exact expression for 
the relative uncertainty weight 


= rp + 21+ re)(1—cos (Omaxw))  forw <7/Omax 
Eey= { A : forw > 7/0max (eH) 


where rz, is the relative uncertainty in the gain. This bound is irrational. To derive a 
rational weight we first approximate the delay by a first-order Pad’e approximation 
to get 


fe fo 1—%maxg _  _— (14 7E) Omaxs +r 
Kmaxe "=" — k & k(1 + re) 72 ee oe” ( 7 *) * (7.24) 
1+ 43s ays +1 


Since only the magnitude matters this may be represented by the following first-order 
weight 
(1+ )Omaxs + 1% 


Omax 
s=st+1 


wr(s) = (7.25) 
However, as seen from Figure 7.9 by comparing the dotted line (representing wr) 
with the solid line (representing Ir), this weight wr is somewhat optimistic (too 
small), especially around frequencies 1/O0max. To make sure that |wr(jw)| > Ir(w) at 
all frequencies we apply a correction factor and get a third-order weight 


Tr 2 
mia (1+ B)Omaxs-+rx (suas) 8? +2-0.838 - Saaks +1 oo 
: Bmax 5 $1 (Saag) 9? + 2-0.685 Sages 41 ; 
2 2.363 " 2.363 


The improved weight wr(s) in (7.26) is not shown in Figure 7.9, but it would be 
almost indistingutshable from the exact bound given by the solid curve. In practical 
applications, it is suggested that one starts with a simple weight as in (7.25), and if 
it later appears important to eke out a little extra performance then one can try a 
higher-order weight as in (7.26). 
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Figure 7.9: Multiplicative weight for gain and delay uncertainty 


7.4.6 Unmodelled dynamics uncertainty 


Although we have spent a considerable amount of time on modelling 
uncertainty and deriving weights, we have not yet addressed the most 
important reason for using frequency domain (H,.) uncertainty descriptions 
and complex perturbations, namely the incorporation of unmodelled 
dynamics. Of course, unmodelled dynamics is close to neglected dynamics 
which we have just discussed, but it is not quite the same. In unmodelled 
dynamics we also include unknown dynamics of unknown or even infinite 
order. To represent unmodelled dynamics we usually use a_ simple 
multiplicative weight of the form 
TS+19 

wr(s) = Pires (7.27) 
where 79 is the relative uncertainty at steady-state, 1/7 is (approximately) 
the frequency at which the relative uncertainty reaches 100%, and r.. is 
the magnitude of the weight at high frequency (typically, r. > 2). Based 
on the above examples and discussions it is hoped that the reader has 
now accumulated the necessary insight to select reasonable values for the 
parameters 19,%>. and 7 for a specific application. The following exercise 
provides further support and gives a good review of the main ideas. 


Exercise 7.1 Suppose that the nominal model of a plant is 


1 
G(s) = —— 
Cag 
and the uncertainty in the model is parameterized by multiplicative uncertainty with 
the weight 
0.125s + 0.25 


w1(8) = @ 795 74)s 41 
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Call the resulting set II. Now find the extreme parameter values in each of the plants 
(a)-(g) below so that each plant belongs to the set II. All parameters are assumed 
to be positive. One approach is to plot Ir(w) = |G~'G' —1| in (7.14) for each G! 
(Ga,Go, etc.) and adjust the parameter in question until ly just touches |wr(jw)|. 

(a) Neglected delay: Find the largest 0 for Ga = Ge~®® (Answer: 0.13). 

(b) Neglected lag: Find the largest + for Gy = Goa (Answer: 0.15). 

(c) Uncertain pole: Find the range of a for Ge = + (Answer: 0.8 to 1.33). 


sta 
(d) Uncertain pole (time constant form): Find the range of T for Ga = a 
(Answer: 0.7 to 1.5). 
(e) Neglected resonance: Find the range of ¢ for Ge = G Say hiectiana 


(Answer: 0.02 to 0.8). 
(f) Neglected dynamics: Find the largest integer m for Gf = G (soma) 
(Answer: 13). 

(9) Neglected RHP-zero: Find the largest tr, for Gy = Gast (Answer: 0.07). 

These results imply that a control system which meets given stability and 
performance requirements for all plants in II, is also guaranteed to satisfy the same 
requirements for the above plants Ga, Go,...,Gg 

(h) Repeat the above with a new nominal plant G = 1/(s—1) (and with everything 
else the same except Gg =1/(T's —1)). (Answer: Same as above). 

(i) Repeat the above with a new weight 


s+0.3 


wr(8) = Tye 40 


We end this section with a couple of remarks on uncertainty modelling: 


1. We can usually get away with just one source of complex uncertainty for 
SISO systems. 

2. With an H. uncertainty description, it is possible to represent time delays 
(corresponding to an infinite-dimensional plant) and unmodelled dynamics 
of infinite order , even though the nominal model and associated weights 
have finite order. 


7.5 SISO Robust Stability 


We have so far discussed how to represent the uncertainty mathematically. In 
this section, we derive conditions which will ensure the system remains stable 
for all perturbations in the uncertainty set, and then in the subsequent section 
we study robust performance. 


7.5.1 RS with multiplicative uncertainty 


We want to determine the stability of the uncertain feedback system in 
Figure 7.10 when there is multiplicative (relative) uncertainty of magnitude 
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Figure 7.10: Feedback system with multiplicative uncertainty 


|wr(jw)|. With uncertainty the loop transfer function becomes 
L, =G,K = GK(1+ w7Ayz) =L+wy,LAy, |Ar(jw)| < 1, Vw (7.28) 


As always, we assume (by design) stability of the nominal closed-loop system 
(i.e. with A; = 0). For simplicity, we also assume that the loop transfer 
function L, is stable. We now use the Nyquist stability condition to test for 
robust stability of the closed-loop system. We have 


def 


RS © _ System stable VLp 


<> Ly should not encircle the point —1, VL, (7.29) 


41m 


Figure 7.11: Nyquist plot of L, for robust stability 
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1. Graphical derivation of RS-condition. Consider the Nyquist plot of 
L, as shown in Figure 7.11. Convince yourself that |— 1— | =|1+ L| is the 
distance from the point —1 to the centre of the disc representing Ly, and that 
|w7L| is the radius of the disc. Encirclements are avoided if none of the discs 
cover —1, and we get from Figure 7.11 


RS © |wrL|<|1+L|, Vw (7.30) 
wr 

° F un <1,Ve  & $ |wrT| <1, Vw (7.31) 

J |lwrT loo <1 (7.32) 


Note that for SISO systems wy = wo and T = T; = GK(1+ GK)~1, so the 
condition could equivalently be written in terms of wT; or woT. Thus, the 
requirement of robust stability for the case with multiplicative uncertainty 
gives an upper bound on the complementary sensitivity: 


RS © |T| <1/|wr|, Vw (7.33) 


We see that we have to detune the system (i.e., make T small) at frequencies 
where the relative uncertainty |w7| exceeds 1 in magnitude. Condition (7.33) 
is exact (necessary and sufficient) provided there exist uncertain plants such 
that at each frequency all perturbations satisfying |A(jw)| < 1 are possible. 
If this is not the case, then (7.33) is only sufficient for RS. For example, this 
is the case if the perturbation is restricted to be real, as for the parametric 
gain uncertainty in (7.5). 


Example 7.6 Consider the following nominal plant and PI-controller 


3(—28 +1) 12.75 +1 
(5s + 1)(10s +1) 12.75 


Recall that this is the inverse response process from Chapter 2. Initially, we select 
Ke = Kei = 1.13 as suggested by the Ziegler-Nichols tuning rule. It results in a 
nominally stable closed-loop system. Suppose that one “extreme” uncertain plant is 
G'(s) = 4(—3s+1)/(4s+1)?. For this plant the relative error |(G' —G)/G| is 0.33 at 
low frequencies; it is 1 at about frequency 0.1 rad/s, and it is 5.25 at high frequencies. 
Based on this and (7.27) we choose the following uncertainty weight 


10s + 0.33 
(10/5.25)s +1 


which closely matches this relative error. We now want to evaluate whether the 
system remains stable for all possible plants as given by Gp = G(1 + wrAr) where 
Ar(s) is any perturbation satisfying ||Allo < 1. This is not the case as seen 
from Figure 7.12 where we see that the magnitude of the nominal complementary 
sensitivity function T, = GKki/(1+GK1) exceeds the bound 1/|w1| from about 0.1 
to 1 rad/s, so (7.83) is not satisfied. To achieve robust stability we need to reduce 
the controller gain. By trial and error we find that reducing the gain to K.2 = 0.31 
just achieves RS, as is seen from the curve for Tz = GK2/(1+GK2) in Figure 7.12. 


G(s) = K(s) = Ke 


wr(s) = 
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Figure 7.12: Checking robust stability with multiplicative uncertainty. 


Remark. For the “extreme” plant G'(s) we find as expected that the closed-loop 
system is unstable with Ke. = 1.13. However, with Keo = 0.31 the system is 
stable with reasonable margins (and not on the limit of instability as one might have 
expected); we can increase the gain by almost a factor of two to K. = 0.58 before 
we get instability. This illustrates that condition (7.83) is only a sufficient condition 
for stability, and a violation of this bound does not imply instability for a specific 
plant G’. However, with K-2 =0.31 there exists an allowed Ar and a corresponding 
Gp = G(1+ wrAr) which yields To) = Ts on the limit of instability. 

2. Algebraic derivation of RS-condition. Since L, is assumed stable, 
and the nominal closed-loop is stable, the nominal loop transfer function L(jw) 
does not encircle —1. Therefore, since the set of plants is norm-bounded, it 
then follows that if some Lp; in the uncertainty set encircles —1, then there 
must be another Lye in the uncertainty set which goes exactly through —1 at 
some frequency. Thus, 


RS © |1+L,|40, VL), (7.34) 
& |L+L,|>0, VLy,Vw (7.35) 
° [1 + L+w,LA]| > 0, ViAr| < 1,Vw (7.36) 


At each frequency the last condition is most easily violated (the worst case) 
when the complex number Ar(jw) is selected with |Ar(jw)| = 1 and with 
phase such that the terms (1+ L) and w;LAyz have opposite signs (point in 
the opposite direction). Thus 


RS @|14+L|—|wL|>0, Ww @lwT|) <1, Ww (7.37) 
and we have rederived (7.32). 


Remark. Unstable plants. The stability conditions (7.32) and (7.37) also apply 
to the case when L and L, are unstable as long as the number of RHP-poles remains 
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the same for each plant in the uncertainty set. This follows since the nominal closed- 
loop system is assumed stable, so we must make sure that the perturbation does not 
change the number of encirclements, and (7.32) is the condition which guarantees 
this. 


UA YA 


Figure 7.13: M — A structure 


3. MA-structure derivation of RS-condition. This derivation is a 
preview of a general analysis presented in the next chapter. The reader should 
not be too concerned if he or she does not fully understand the details at this 
point. The derivation is based on applying the Nyquist stability condition to 
an alternative “loop transfer function” MA rather than Ly. The argument 
goes as follows. If the nominal (A; = 0) feedback system is stable then the 
stability of the system in Figure 7.10 is equivalent to stability of the system 
in Figure 7.13, where A = A; and 


M =wK(14+GK)'G = wT (7.38) 


is the transfer function from the output of A; to the input of A;. Notice 
that the only source of instability in Figure 7.10 is the new feedback loop 
created by A;. We now apply the Nyquist stability condition to Figure 7.13. 
We assume that A and M = w7T are stable; the former implies that G and 
G, must have the same unstable poles, the latter is equivalent to assuming 
nominal stability of the closed-loop system. The Nyquist stability condition 
then determines RS if and only if the “loop transfer function” MA does not 
encircle —1 for all A. Thus, 


RS © |1+MA/>0, Wuw,V/|A] <1 (7.39) 


The last condition is most easily violated (the worst case) when A is selected 
at each frequency such that |A] = 1 and the terms MA and 1 have opposite 
signs (point in the opposite direction). We therefore get 


RS cos 1-—|M(jw)|>0, Vw (7.40) 
co |[M(jw)|<1, Vw (7.41) 
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which is the same as (7.32) and (7.37) since M = wrT. The MA-structure 
provides a very general way of handling robust stability, and we will discuss 
this at length in the next chapter where we will see that (7.41) is essentially 
a clever application of the small gain theorem where we avoid the usual 
conservatism since any phase in MA is allowed. 


7.5.2 Comparison with gain margin 


We want to compare the above robustness results with the classical concept 
of gain margin. To this effect, consider a given plant Go and given controller 
K, we want to evaluate the factor, kmax, by which we can multiply the loop 
gain [Lo = GoK before we get instability. This problem may be formulated as: 
given 

Ly = kyLo; kp € (1, kmax]| (7.42) 
find the largest value of kmax such that the closed-loop system is stable. 


1. Real perturbation. The exact value of kma,, which is obtained with A 
real, is the gain margin (GM) from classical control. We have (recall (2.34)) 


1 
kmax1 = GM = ——— 7.43 
? |Lo(jw180) | oe 
where Ww g9 is the frequency where /Lp = —180°. 


2. Complex perturbation. Alternatively, represent the gain uncertainty 
as complex multiplicative uncertainty. Ly = kpLo = kLo(1+7,A) where 


kmax +1 kmax — 1 
2 me rUe kmax +1 Cs ) 


k = 
Note that the nominal L = kLo is not fixed, but depends on kmax. The 
robust stability condition ||w7T'||o. < 1 (which is derived for complex A) with 
wr = rz then gives 
kLo 


am 1 7.45 
IPTC p pales em) 


foe) 


Here both r;, and k depend on kmax, and (7.45) must be solved iteratively to 
find kmax2. Condition (7.45) would be exact if A were complex, but since it 
is not we expect kmax2 to be somewhat smaller than GM. 

Example 7.7 To check this numerically consider a system with Lo = Meee We 
find wigo = 2 [rad/s] and |Lo(jwiso)| = 0.5, and the exact factor by which we can 
increase the loop gain is, from (7.43), kmax1 = GM = 2. On the other hand, use of 
(7.45) yields kmax2 = 1.78, which as expected is less than GM=2. This illustrates 
the conservatism involved in replacing a real perturbation by a complex one. 
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Exercise 7.2 Represent the gain uncertainty in (7.42) as multiplicative complez 
uncertainty with nominal model G = Go (rather than G = kGo used above). (a) Find 
wr and use the RS-condition ||wrT||oo <1 to find kmax3. Note that no iteration is 
needed in this case since the nominal model and thus T = To is independent of 
kmax- (b) One expects kmax3 to be even more conservative than kmax2 since this 
uncertainty description is not even tight when A is real. Show that this is indeed the 
case using the numerical values from Example 7.7. 


7.5.3 RS with inverse multiplicative uncertainty 


Figure 7.14: Feedback system with inverse multiplicative uncertainty 


We will derive a corresponding RS-condition for a feedback system with 
inverse multiplicative uncertainty (see Figure 7.14) in which 


Gp = Gi + Wir (s)Ajr)~! (7.46) 


Algebraic derivation. Assume for simplicity that the loop transfer function 
Ly is stable, and assume stability of the nominal closed-loop system. Robust 
stability is then guaranteed if encirclements by L,(jw) of the point —1 are 
avoided, and since L, is in a norm-bounded set we have 


RS © |i+ L,| >0, VLp,Vw (7.47) 
cos jl+ L(+ wir Air) | > 0, V|Aaz| <1,Vw (7.48) 
cos [1 + wir Air + L| > 0, V|Aaz| <1,Vw (7.49) 


The last condition is most easily violated (the worst case) when Aj; is selected 
at each frequency such that |Aj7| = 1 and the terms 1+ LD and w;; Air have 
opposite signs (point in the opposite direction). Thus 


RS os [1 af Ll = |warl >0. Vw (7.50) 
cod |wir S| <i, Vu (7.51) 
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In this derivation we have assumed that Ly is stable, but this is not necessary 
as one may show by deriving the condition using the M A-structure. Actually, 
the RS-condition (7.51) applies even when the number of RHP-poles of Gp 
can change. 

Control implications. From (7.51) we find that the requirement of robust 
stability for the case with inverse multiplicative uncertainty gives an upper 
bound on the sensitivity, 


IS] <1/|wir|, Vw (7.52) 


We see that we need tight control and have to make S small at frequencies 
where the uncertainty is large and |w;r| exceeds 1 in magnitude. This may be 
somewhat surprising since we intuitively expect to have to detune the system 
(and make S = 1) when we have uncertainty, while this condition tells us to do 
the opposite. The reason is that this uncertainty represents pole uncertainty, 
and at frequencies where |w;r| exceeds 1 we allow for poles crossing from the 
left to the right-half plane (G, becoming unstable), and we then know that 
we need feedback (|.5| < 1) in order to stabilize the system. 

However, |S| < 1 may not always be possible. In particular, assume that 
the plant has a RHP-zero at s = z. Then we have the interpolation constraint 
S(z) = 1 and we must as a prerequisite for RS, ||wirS|lo < 1, require 
that wir(z) < 1 (recall the maximum modulus theorem, see (5.15)). Thus, 
we cannot have large pole uncertainty with |wir(jw)| > 1 (and hence the 
possibility of instability) at frequencies where the plant has a RHP-zero. This 
is consistent with the results we obtained in Section 5.9. 


Exercise 7.3 Represent the gain uncertainty in (7.42) by inverse gain uncertainty 
as in (7.84). Find the largest value of kmax which satisfies the RS-condition 
||wirS|loo < 1 (solve iteratively). Compare with kmax2 found in Example 7.7. 


7.6 SISO Robust Performance 


7.6.1 SISO nominal performance in the Nyquist plot 


Consider performance in terms of the weighted sensitivity function as 
discussed in Section 2.7.2. The condition for nominal performance (NP) is 
then 


NP © |wpeS|<1 Ww & |wp|<|1+L| Wo (7.53) 


|1 + L| represents at each frequency the distance of L(jw) from the point 
—1 in the Nyquist plot, so L(jw) must be at least a distance of |wp(jw)| from 
—1. This is illustrated graphically in Figure 7.15, where we see that for NP, 
L(jw) must stay outside a disc of radius |wp(jw)| centred on —1. 
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Figure 7.15: Nyquist plot illustration of nominal performance condition |wp| < 
1+ L| 


7.6.2 Robust Performance 


Figure 7.16: Diagram for robust performance with multiplicative uncertainty 


For robust performance we require the performance condition (7.53) to be 
satisfied for all possible plants, that is, including the worst-case uncertainty. 


RP & |wpSp| <1 VSp,Vw (7.54) 


& |wp|<|1+L,| VL, Ww (7.55) 


This corresponds to requiring |y/d| < 1VA; in Figure 7.16, where we consider 
multiplicative uncertainty, and the set of possible loop transfer functions is 


L, =G,K = L(1+ w7Ar) =L+wy,LAy;, (7.56) 


1. Graphical derivation of RP-condition. Condition (7.55) is 
illustrated graphically by the Nyquist plot in Figure 7.17. For RP we must 
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require that all possible L,(jw) stay outside a disc of radius |wp(jw)| centred 
on —1. Since L, at each frequency stays within a disc of radius w;L centred 
on L, we see from Figure 7.17 that the condition for RP is that the two discs, 
with radii wp and wy, do not overlap. Since their centres are located a distance 
|1+ L| apart, the RP-condition becomes 


RP © |wp|+|wrL|<|14+L|, Vw (7.57) 
& |we(l+L) 44 lwLbO4+L) <1, Ww (7.58) 


or in other words 


RP © max, (|wpeS|+|w7T|) <1 (7.59) 


Figure 7.17: Nyquist plot illustration of robust performance condition |wp| < 
|\1 + Lp| 


2. Algebraic derivation of RP-condition. From the definition in (7.54) 
we have that RP is satisfied if the worst-case (maximum) weighted sensitivity 
at each frequency is less than 1, that is, 


RP © max |wpSp| < 1, Vu (7.60) 


(strictly speaking, max should be replaced by sup). The perturbed sensitivity 
is S, = (I+ L,)~t = 1/(1+ L+ w/LAy), and the worst-case (maximum) is 
obtained at each frequency by selecting |A;|=1 such that the terms (1+ L) 
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and wr;LAy (which are complex numbers) point in opposite directions. We 
get 

url dw | 
J1+L|—|wrL| 1—|wrT} 


and by substituting (7.61) into (7.60) we rederive the RP-condition in (7.59). 


max |weS>| = (7.61) 


Remarks on RP-condition (7.59). 


1. The term (Nap) = |weS|+|wrT| in (7.59) is the structured singular value (1) 
for RP for this particular problem, see (8.128). We will discuss 4 in much more 
detail in the next chapter. 

2. Note that the structured singular value 4 is not equal to the worst-case weighted 
sensitivity, maxs, |wpSp|, given in (7.61) (although many people seem to think 
it is). The worst-case weighted sensitivity is equal to skewed- (u°) with fixed 
uncertainty for this problem, see Section 8.10.3. Thus, in summary we have for 
this particular robust performance problem: 


|wp S| 


——_ .62 
T—|wrTl a2) 


w= |weS|+|wiT|, w= 


Note that y and p° are closely related since  < 1 if and only if w* <1. 


3. The RP-condition (7.59) for this problem is closely approximated by the following 


mixed sensitivity 71. condition: 


[err | = max y/|wpS|? + |wrT |? <1 (7.63) 
wT w 


To be more precise, we find from (A.93) that condition (7.63) is within a factor of 
at most V2 to condition (7.59). This means that for SISO systems we can closely 
approximate the RP-condition in terms of an H.. problem, so there is little need 
to make use of the structured singular value. However, we will see in the next 
chapter that the situation can be very different for MIMO systems. 


4. The RP-condition (7.59) can be given a loop-shaping interpretation. At a given 


frequency we have that |wpS|+|wrT| <1 (RP) is satisfied if (see Exercise 7.4) 


1 
|L| > ame (at frequencies where |wr| < 1) (7.64) 
WI 
or if 
|L| < Delwe) (at frequencies where |wp| < 1) (7.65) 
1+ |wr| 


Conditions (7.64) and (7.65) may be combined over different frequency ranges. 
Condition (7.64) is most useful at low frequencies where generally |wr| < 1 and 
|jwp| > 1 (tight performance requirement) and we need |L| large. Conversely, 
condition (7.65) is most useful at high frequencies where generally |wr| > 1, 
(more than 100% uncertainty), |wp| < 1 and we need L small. The loop-shaping 
conditions (7.64) and (7.65) may in the general case be obtained numerically from 
p-conditions as outlined in Remark 13 on page 340. This is discussed by Braatz, 
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Morari and Skogestad (1995) who derive bounds also in terms of S and T, and 
also derive necessary bounds for RP in addition to the sufficient bounds in (7.64) 
and (7.65), see also Exercise 7.5. 


Exercise 7.4 Derive the loop-shaping bounds in (7.64) and (7.65) which are 
sufficient for |\weS|+|wrT| <1 (RP). Hint: Start from the RP-condition in the 
form |wp|+|wrL| < |1+L] and use the facts that |1+L| > 1—|L| and |1+L| > |L|-1. 


Exercise 7.5 Also derive, from |wpS|+|wrT| <1, the following necessary bounds 
for RP (which must be satisfied) 


—1 
1b] > PE. (for jwp| > 1 and jr] <1) 
ae I 
1—|wp| 
|L| < rene (for |wp| <1 and |w7| > 1) 
= 


Hint: Use |1+ L] <1+|L}. 


Example 7.8 Robust performance Problem. Consider robust performance of 


Figure 7.18: Diagram for robust performance in Example 7.8 


the SISO system in Figure 7.18, for which we have 


y 


d 


def 


RP & |#) <1, ViAu| <1, Ww; wr(s)=0.25+ —; wa(s)=Tru—— (7.66) 


a) Derive a condition for robust performance (RP). 

b) For what values of ry is it impossible to satisfy the robust performance 
condition? 

c) Let r, = 0.5. Consider two cases for the nominal loop transfer function: 1) 
GKi(s) =0.5/s and 2) GK2(s) = Tee For each system, sketch the magnitudes 
of S and its performance bound as a function of frequency. Does each system satisfy 
robust performance? 

Solution. a) The requirement for RP is |wpSp| <1, VSp,Vw, where the possible 
sensitivities are given by 


1 S 


wT OR Saas eas 


(7.67) 
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The condition for RP then becomes 


wes 
RP © ——— 1, VAu,V 7.68 
Tanase Vw (7.68) 
A simple analysis shows that the worst case corresponds to selecting Au with 
magnitude 1 such that the term w,A.S is purely real and negative, and hence we 
have 


RP eS jwpS|<1—|wiS|, Vw (7.69) 
oe jweS|+|wuS|<1, Vw (7.70) 
&  |S(jw)| < : Yu (7.71) 


lwp (jw)| + \wu(w)|? 


b) Since any real system is strictly proper we have |S| = 1 at high frequencies and 
therefore we must require |wu(jw)| + |we(jw)| < 1 as w > oo. With the weights 
in (7.66) this is equivalent to ru + 0.25 < 1. Therefore, we must at least require 
Tu <0.75 for RP, and RP cannot be satisfied if ry > 0.75. 


1 


10 1 
|S2| 1 
3, era aoe MELE Tl 
£10 F ieie 
g ue (S| 
= eae 
10°F : 
10° 2 iz zi i 2 
107 10” 10 10 10 
Frequency 


Figure 7.19: Robust performance test 


c) Design S1 yields RP, while Sx does not. This is seen by plotting the RP- 
condition (7.71) graphically as shown in Figure 7.19; |.Si| has a peak of 1 while |S2| 
has a peak of about 2.45. 


7.6.3 The relationship between NP, RS and RP 


Consider a SISO system with multiplicative uncertainty, and assume that the 
closed-loop is nominally stable (NS). The conditions for nominal performance, 
robust stability and robust performance can then be summarized as follows 


NP © — |wpS| <1,Ww (7.72) 
RS S lwrT| <1,Vw (7.73) 
RP © jweS|+|wrT| <1, Vw (7.74) 
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From this we see that a prerequisite for RP is that we satisfy NP and RS (i.e., 
RP => RS, and RP > NP). This applies in general, both for SISO and MIMO 
systems and for any uncertainty. In addition, for SISO systems, if we satisfy 
both RS and NP, then we have at each frequency 


|wpS| + |wrT| < 2max{|wpS]|,|wrT|} < 2 (7.75) 


It then follows that, within a factor of at most 2, we will automatically get 
RP when the subobjectives of NP and RS are satisfied (i.e., “NP + RS > 
RP within a factor 2”). Thus, RP is not a “big issue” for SISO systems, and 
this is probably the main reason why there is little discussion about robust 
performance in the classical control literature. On the other hand, as we shall 
see in the next chapter, for MIMO systems we may get very poor RP even 
though the subobjectives of NP and RS are individually satisfied. 

To satisfy RS we generally want T small, where as to satisfy NP we generally 
want S small. However, we cannot make both S and T small at the same 
frequency because of the identity $++T = 1. This has implications for RP, since 
jw p||S] + fwzlIT| > min{wel, fwrl}((S|-+ (TI), where |$|+ [7] >|S+ 7] =1, 
and we derive at each frequency 


|wpS| + |wrT| > min{|wp|,|wr|} (7.76) 


We conclude that we cannot have both |wp| > 1 (i.e. good performance) 
and |w;| > 1 (i.e. more than 100% uncertainty) at the same frequency. One 
explanation for this is that at frequencies where |w;| > 1 the uncertainty will 
allow for RHP-zeros, and we know that we cannot have tight performance in 
the presence of RHP-zeros. 


7.6.4 The similarity between RS and RP 


There is a strong similarity between RS with multiple perturbations and RP. 
To see this consider the following two cases as illustrated in Figure 7.20: 


(a) RP with multiplicative uncertainty 
(b) RS with combined multiplicative and inverse multiplicative uncertainty 


As usual the uncertain perturbations are normalized such that ||Ai||.. <1 
and ||/As||oo < 1. Since we use the H.. norm to define both uncertainty and 
performance and since the weights in (a) and (b) are the same, the tests for 
RP and RS in (a) and (b), respectively, are identical. This may be argued 
from the block diagrams, or by simply evaluating the conditions for the two 
cases as shown below. 


(a) The condition for RP with multiplicative uncertainty was derived above, 
but with w; replaced by wp and with wz replaced by wy. We found 


RP © |w,S|+|woT| <1, Ww (7.77) 
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Figure 7.20: (a) Robust performance with multiplicative uncertainty. 
(b) Robust stability with combined multiplicative and inverse multiplicative 
uncertainty. 


(b) We will now derive the RS-condition for the case where L, is stable (this 
assumption may be relaxed if the more general MA-method and the 
structured singular value are used). We want the system to be closed- 
loop stable for all possible A; and Ag. RS is equivalent to avoiding 
encirclements of -1 by the Nyquist plot of Lp. That is, the distance 
between L, and —1 must be larger than zero, i.e., |1 + L,| > 0, and 


therefore 
RS © |1+L,|>0 VL,,Vw (7.78) 
co |1 + Li + waAs)(1 = wiAi)}| > 0, VA, VAg, Vw (7.79) 
oS |1 +2 + Lwe As = w Ay4| > 0, VAy, VAs, Vw (7.80) 


Here the worst case is obtained when we choose A; and Ay with 
magnitudes 1 such that the terms Dw2Az2 and w;A, are in the opposite 
direction of the term 1+ L. We get 


B- -& TSE oieeloiiSt. wa Ga 
c— |w4 S| + |woT| < 1, Vw (7.82) 


which is the same condition (7.74) found for RP. 
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7.6.5 What we would really like to do — consider 
multiple perturbations! 


To get a better description of the uncertainty, in particular for MIMO systems 
and for parametric uncertainty, we need to use multiple perturbations (A’s). 
For example, we just considered the case with both multiplicative input 
uncertainty, Az, and inverse multiplicative output uncertainty, A,. In the 
general case, we will not be able to derive RS-conditions using simple algebra 
as above. However, we will see in the next chapter that the M A-structure 
together with the structured singular value provides a very general way of 
deriving necessary and sufficient RS-conditions even for complex cases. The 
extension to include RP is then straightforward because of the equivalence 
between RP and RS as discussed above. 


7.7 Examples of parametric uncertainty 


We now provide some further examples of how to represent parametric 
uncertainty. The perturbations A must be real to exactly represent parametric 
uncertainty. 


7.7.1 Parametric gain uncertainty 
We showed in Example 7.1 how to represent scalar parametric gain uncertainty 
G(s) = kpGo(s) where 

kmin < kp < imax (7.83) 


as multiplicative uncertainty G, = G(1+ w;A;) with nominal model G(s) = 
kGo(s) and uncertainty weight w; = rp = (kmax — kmin)/(Kmax + kmin)- Ar is 
a real scalar, —1 < A; <1. 

Inverse form. Alternatively, we can represent gain uncertainty as inverse 
multiplicative uncertainty: 


Tj; : G;,,(s) = G(s) + wir(s) Air); -1<Ajr <1 (7.84) 


with w;r =r, and G(s) = k;G where 


kminkmax 
k, = 2————_ ; 
imax + kmin i pa) 


Exercise 7.6 Derive (7.84) and (7.85). (Hint: The gain variation in (7.83) can be 
written exactly as kp = ki/(1—rpA).) Show that the form in (7.84) does not allow 
for kp =0 (even with A large), and discuss why this may be a possible advantage. 
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7.7.2. Parametric pole uncertainty 


Consider uncertainty in the parameter a in a state space model, y = ay + bu, 
corresponding to the uncertain transfer function Gp(s) = b/(s — ap). More 
generally, consider the following set of plants 
1 
G,(s) = Go (s); Qmin < Gp < Gmax (7.86) 


S— ap 


Tf @min and Gmax have different signs then this means that the plant can 
change from stable to unstable with the pole crossing through the origin 
(which happens in some applications). This set of plants can be written as 


G(s) 
s—@(1+r,A)’ 


Gp = ee (7.87) 
which can be exactly described by inverse multiplicative uncertainty as in 
(7.84) with nominal model G = Go(s)/(s — @) and 


Taga 


wir(s) = ae (7.88) 
The magnitude of the weight wir(s) is equal to rg at low frequencies. If rq is 
larger than 1 then the plant can be both stable and unstable. As seen from 
the RS-condition in (7.51), a value of |w;r| larger than 1 means that || must 
be less than 1 at the same frequency, which is consistent with the fact that 
we need feedback (.S small) to stabilize an unstable plant. 

It is also interesting to consider another form of pole uncertainty, namely 
that associated with the time constant: 


1 
G(s) = aan Gols); Tmin < Tp < Tmax (7.89) 
p 


This results in uncertainty in the pole location, but the set of plants is entirely 
different from that in (7.86). The reason is that in (7.86) the uncertainty 
affects the model at low frequency, whereas in (7.89) the uncertainty affects 
the model at high frequency. The corresponding uncertainty weight as derived 
in (7.7) is 
1,78 

Ce ae (7.90) 
This weight is zero at w = 0 and approaches r, at high frequency, whereas the 
weight wir in (7.88) is rq at w= 0 and approaches zero at high frequencies. 


7.7.3 Parametric zero uncertainty 


Consider zero uncertainty in the “time constant” form, as in 


G(s) = (ql + Tp8)G'o(s); Tmin < Tp < Tmax (7.91) 
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where the remaining dynamics Go(s) are as usual assumed to have no 
uncertainty. For example, let —1 < 7, < 3. Then the possible zeros z, = —1/Tp 
cross from the LHP to the RHP through infinity: z, < —1/3 (in LHP) and 
Zp > 1 (in RHP). The set of plants in (7.91) may be written as multiplicative 
(relative) uncertainty with 


wr(s) =7r;,7s/(1+7s) (7.92) 


The magnitude |w7(jw)| is small at low frequencies, and approaches r, (the 
relative uncertainty in 7) at high frequencies. For cases with r, > 1 we allow 
the zero to cross from the LHP to the RHP (through infinity). 


Exercise 7.7 Parametric zero uncertainty in zero form. 
Consider the following alternative form of parametric zero uncertainty 


Gp(s) = (s+ 2p)Go(s);  Zmin < Zp < Zmax (7.93) 


which caters for zeros crossing from the LHP to the RHP through the origin 
(corresponding to a sign change in the steady-state gain). Show that the resulting 
multiplicative weight is wr(s) = r.Z/(s+2Z) and explain why the set of plants given 
by (7.93) is entirely different from that with the zero uncertainty in “time constant” 
form in (7.91). Explain what the implications are for control if rz > 1. 


Remark. Both of the two zero uncertainty forms, (7.91) and (7.93) can occur in 
practice. An example of the zero uncertainty form in (7.93), which allows for changes 
in the steady-state gain, is given in Example 7.10. 


7.7.4 Parametric uncertainty in state-space form 


We here introduce the reader to parametric state-space uncertainty by way of 
a few examples. A general procedure for handling this kind of uncertainty is 
given by Packard (1988). Consider an uncertain state-space model 


£ = Apx+ Bou (7.94) 
= Cypr+ Du (7.95) 

or equivalently 
G,(s) = Cy(sI — Ap) By + Dp (7.96) 


Assume that the underlying cause for the uncertainty is uncertainty in some 
real parameters 61,62,... (these could be temperature, mass, volume, etc.), 
and assume in the simplest case that the state-space matrices depend linearly 
on these parameters i.e. 


Ap =A+)_6A;, Bp = B+) 4Bi Cp =C+) 6iCi, Dp = D+ 9° 5D; 
(7.97) 
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where A, B,C and D model the nominal system. This description has multiple 
perturbations, so it cannot be represented by a single perturbation, but it 
should be fairly clear that we can separate out the perturbations affecting 
A,B,C and D, and then collect them in a large diagonal matrix A with the 
real 6;’s along its diagonal. Some of the 6;’s may have to be repeated. For 
example, we may write 


A, = A+ )_6;A; = A+ WAW, (7.98) 


where A is diagonal with the 6;’s along its diagonal. Writing H(s) = (sI — 
A)~1, we get 


(sf — A,)~' = (sf — A— Wo AW,)7* = (I — H(s)W2AW,)7'H(s) (7.99) 


This is illustrated in the block diagram of Figure 7.21, which is in the form 
of an inverse additive perturbation (see Figure 8.5 (d)). 


(sI — A,)~4 


Figure 7.21: Uncertainty in state space A-matrix 


Example 7.9 Suppose Ap is a function of two parameters kp = 1+ w16, (-1 < 
61 <1) and ap =3 + wrdb2 (—1 < 62 < 1) as follows: 


— | —2—kp kip — Op 
Ap i Pon ks (7.100) 
Then 
= —3 —2 SW. Wy 0 —Ww2 
Ao iG 2 +61 wi ia +82 Ee 0 (10) 
_ $$ 
A Al Ag 
ear fe og 6 0 0 1 =] 
= A+ | : j 0 & O|]]1 0 (7.102) 
Wy 2wea 0 
0 0 8 0 1 
We 


A Wi 
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Note that 6; appears only once in A, whereas 62 needs to be repeated. This is related 
to the ranks of the matrices A, (which has rank 1) and Az (which has rank 2). 


Additional repetitions of the parameters 6; may be necessary if we also 
have uncertainty in B,C and D. It can be shown that the minimum number 
of repetitions of each 6; in the overall A-matrix is equal to the rank of each 


matrix [2 | (Packard, 1988; Zhou et al., 1996). Also, note that seemingly 


a 


nonlinear parameter dependencies may be rewritten in our standard linear 
block diagram form, for example, we can handle 6? (which would need 61 
repeated), oes etc. This is illustrated next by an example. 
Example 7.10 Parametric uncertainty and repeated perturbations. 

This example illustrates how most forms of parametric uncertainty can be 
represented in terms of the A-representation using linear fractional transformations 
(LFTs). Consider the following state space description of a SISO plant’ 


&=Apx+Bpu, y=Cau (7.103) 
Reeser of pal = |,e=i0 4 (7.104) 
Ap = ‘ Seg pte eS eee 1 


The constant k > 0 may vary during operation, so the above description generates 
a set of possible plants. Assume that k =0.5+0.1, which may be written as 


k=0.540.16, |6| <1 (7.105) 


Note that the parameter k enters into the plant model in several places, and we will 
need to use repeated perturbations in order to rearrange the model to fit our standard 
formulation with the uncertainty represented as a block-diagonal A-matriz. 

Let us first consider the input gain uncertainty for state 1, that is, the variations in 
byt = (1—k)/k. Even though bp: is a nonlinear function of 6, it has a block-diagram 


T This is actually a simple model of a chemical reactor (CSTR) where u is the feed flowrate, 
x1 is the concentration of reactant A, y = x2 is the concentration of intermediate product 
B and k = q® is the steady-state value of the feed flowrate. Component balances yield 


Véa =qeaz—9qcea—kicaV_ [mol A/s] 


Vép =—qep +kicaV —kocpgV_ [mol B/s] 


where V is the reactor volume. Linearization and introduction of deviation variables, 
a, = Aca,x%2 = Acg, and u = Aq, yields, with kj = 1,k2 =1,V =1 and cy, =cpz = 1, 


a1 (l+q*)z1 4 (Cay lu 


tg = 2, —(1+q*)a2—-—u 
where the superscript * signifies a steady-state value. The values of g* and Chr depend 
on the operating point, and it is given that at steady-state we always have Ten; = 


(physically, we may have an upstream mixing tank where a fixed amount of A is fed in). 
By introducing k = g* we get the model in (7.104). 
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representation and may thus be written as a linear fractional transformation (LFT). 


We have 
_1-k _ 05-016 — 1—0.26 


by (6 al 
(8) k 0.54016 1+0.26 C106) 
which may be written as a scalar LFT (see 3.106)) 

bpi(6) = Fu(N,6) = n22 + ni26(1 = ni16) > nat (7.107) 
with ne2 = 1, 11 = —0.2, niene1 = —0.4. Next consider the pole uncertainty caused 
by variations in the A-matriz, which may be written as 

—1.5 0 —0.1 0 6 0 
a | 1 Er +| 0 c | E ‘| 7-108) 


For our specific example with uncertainty in both B and A, the plant uncertainty 
may be represented as shown in Figure 7.22 where K(s) is a scalar controller. 
Consequently, we may pull out the perturbations and collect them in a3 x3 diagonal 
A-block with the scalar perturbation 6 repeated three times, 


AS 


6 
6 | (7.109) 
6 


and we may then obtain the interconnection matrix P by rearranging the block 
diagram of Figure 7.22 to fit Figure 3.20. It is rather tedious to do this by hand, but 
it is relatively straightforward with the appropriate software tools. These issues are 
discussed in more detail in the nezrt chapter. 


Figure 7.22: Block diagram of parametric uncertainty 


Remark. The above example is included in order to show that quite complex 
uncertainty representations can be captured by the general framework of block- 
diagonal perturbations. It is not suggested, however, that such a complicated 
description should be used in practice for this example. A little more analysis will 
show why. The transfer function for this plant is 


k+2.414)(k—-0.414 
(s ane ut ) 


(s+14+k) 


Gp(s) = (7.110) 
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and we note that it has a RHP-zero for k < 0.414, and that the steady state gain is 
zero for k = 0.414. The three plants corresponding to k = 0.5, 0.4 and 0.6 are 
(s +0.5) (s— 0.1) (s+ 0.93) 


Gs) = — Ta?” Gals) = — Caray Goals) = —Gaeye (7.111) 


From a practical point of view the pole uncertainty therefore seems to be of very 
little importance and we may use a simplified uncertainty description with zero 
uncertainty only, e.g. 


(8 + 2p) 
9p(8) = (e415) —0.1 < z <0.93 (7.112) 


In any case, we know that because of the RHP-zero crossing through the origin, the 
performance at low frequencies will be very poor for this plant. 


7.8 Additional exercises 


Exercise 7.8 Consider a “true” plant 


Be 01s 
(2s +1)(0.1s +1)? 


G(s) = 


i) Derive and sketch the additive uncertainty weight when the nominal model is 
G(s) = 3/(2s +1). ti) Derive the corresponding robust stability condition. iti) Apply 
this test for the controller K(s) = k/s and find the values of k that yield stability. 
Is this condition tight? 


Exercise 7.9 Uncertainty weight for a first-order model with delay. 
Laughlin and Morari (1989) considered the following parametric uncertainty 
description 

BB One hy Cae Wi ATi PR OS [Gases 
Ts t1 } ‘Pp min; Vmax]; /p min; ‘max ]}) Pp min; “max 

(7.113) 

where all parameters are assumed positive. They chose the mean parameter values 
as (k,0,7) giving the nominal model 


Gp(s) = 


YY A k —6s 
= = —_—_ 114 
G(s) = G(s) art (7.114) 
and suggested use of the following multiplicative uncertainty weight 
kmax TS+1 Tst+1 Omax — Omin 
=—.- . ee 11 
wrt(s) k tmings+1 —Ts+1 , 4 at) 


a) Show that the resulting stable and minimum phase weight corresponding to the 
uncertainty description in (7.16) is 


wrr(s) = (1.25s" + 1.555 + 0.2)/(2s + 1)(0.25s + 1) (7.116) 
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Note that this weight cannot be compared with (7.18) or (7.19) since the nominal 
plant is different. 

b) Plot the magnitude of wrr as a function of frequency. Find the frequency where 
the weight crosses 1 in magnitude, and compare this with 1/O0max. Comment on your 
answer. 

c) Find Ir(jw) using (7.14) and compare with |wrr|. Does the weight (7.116) 
and the uncertainty model (7.1) include all possible plants? (Answer: No, not quite 
around frequency w = 5). 


Exercise 7.10 Consider again the system in Figure 7.18. What kind of 
uncertainty might wy. and Ay represent? 


Exercise 7.11 Neglected dynamics. Assume we have derived the following 


detailed model 
3(—0.5s + 1) 


(25+ 1)(0.1s +1)? 
and we want to use the simplified nominal model G(s) = 3/(2s+1) with multiplicative 
uncertainty. Plot lr(w) and approximate it by a rational transfer function wr(s). 


Gaetail (s) = 


(7.117) 


7.9 Conclusion 


In this chapter we have shown how model uncertainty can be represented in 
the frequency domain using complex norm-bounded perturbations, ||Alloo. < 1. 
At the end of the chapter we also discussed how to represent parametric 
uncertainty using real perturbations. 

We showed that the requirement of robust stability for the case of 
multiplicative complex uncertainty imposes an upper bound on the allowed 
complementary sensitivity, |w,T| < 1, Vw. Similarly, the inverse multiplicative 
uncertainty imposes an upper bound on the sensitivity, |w;rS| < 1, Vw. We also 
derived a condition for robust performance with multiplicative uncertainty, 
|jwpS| + |wrT| <i, Vw. 

The approach in this chapter was rather elementary, and to extend the 
results to MIMO systems and to more complex uncertainty descriptions we 
need to make use of the structured singular value, yu. This is the theme of the 
next chapter, where we find that |w;T'| and |w;,5S| are the structured singular 
values for evaluating robust stability for the two sources of uncertainty in 
question, whereas |wpS|+|w7T| is the structured singular value for evaluating 
robust performance with multiplicative uncertainty. 


8 


ROBUST STABILITY AND 
PERFORMANCE ANALYSIS 


The objective of this chapter is to present a general method for analyzing robust 
stability and performance of MIMO systems with multiple perturbations. Our main 
analysis tool will be the structured singular value, 4. We also show how the “optimal” 
robust controller, in terms of minimizing jz, can be designed using D K-iteration. This 
involves solving a sequence of scaled Ho. problems. 


8.1 General control formulation with 
uncertainty 


subscript 

For useful notation and an introduction to model uncertainty the reader is 
referred to Sections 7.1 and 7.2. 

The starting point for our robustness analysis is a representation of the 
system in which the uncertain perturbations are collected together in a block- 
diagonal matrix of stable perturbations, 


Ay 


A= . hs (8.1) 


where each A; may represent a specific source of uncertainty, such as input 
uncertainty, Ar, or parametric uncertainty, 6;, where 6; is real. 

Tf we also “pull out” the controller kK’, then we may represent the the system 
as shown in Figure 8.1 where P is the generalized plant. This form is useful 
for controller synthesis. 

Alternatively, if the controller is given and we want to analyze the uncertain 
system, we use the NA-structure in Figure 8.2, where, as shown in (3.107), 
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Figure 8.1: General control configuration (for controller synthesis) 


Figure 8.2: NA structure for robust performance analysis 


N is related to P and K by a lower linear fractional transformation 
N = F(P,K) = Py + Pyro K (I — Po K) Py (8.2) 


The uncertain closed-loop transfer function from w to z (z = F(A)w) is 
related to N and A by an upper linear fractional transformation (LFT) (see 
(3.108)), 

F = F,(N,A) = Noo + No AT — Nn A) M2 (8.3) 


If we only need to analyze robust stability of F’, then we rearrange the system 


UA YA 


Figure 8.3: MA-structure for robust stability analysis 
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into M A-strucure in Figure 8.3 where M = Nj, is the transfer function from 
the output to the input of the perturbations. 

In Section 3.8.8 we discussed how to find P and N for cases without 
uncertainty. The procedure with uncertainty is similar and is illustrated by 
examples below, see page 319. To illustrate the main idea, consider Figure 8.4 
where it shown how to pull out the perturbation blocks to form the nominal 
system N and the block-diagonal perturbations A. 


8.2 Representing uncertainty 


As before, each individual perturbation is normalized, 
a(Aj(jw)) <1 Vw; |d;| <1 (8.4) 


and since from (A.46) the maximum singular value of a block diagonal matrix 
is equal to the largest of the maximum singular values of the individual blocks, 
it then follows that 


a(A(jw)) <1vw @ [lAlle <1 (8.5) 


Note that A has structure, and therefore in the robustness analysis we do not 
want to allow all A such that (8.5) is satisfied, but only the subset which has 
the structure in (8.1) should be considered. In some cases the blocks in A may 
be repeated, that is, we have additional structure. For example, as shown in 
Section 7.7.4, repetition is often needed to handle parametric uncertainty. 


Remark. The assumption about A being stable may be relaxed, but then the 
resulting robust stability and performance conditions will be harder to state and 
to prove. Furthermore, if we use the right form for the uncertainty and allow for 
multiple perturbations, then we can always generate the desired class of plants with 
stable perturbations, so assuming A stable is not really a restriction. 


8.2.1 Differences between SISO and MIMO systems 


The main difference between SISO and MIMO systems is the concept of 
directions which is not relevant in the former. As a consequence MIMO 
systems may experience much larger sensitivity to uncertainty than SISO 
systems. The following example illustrates that for MIMO systems it is 
sometimes critical to represent the coupling between uncertainty in different 
transfer function elements. 


Example 8.1 Coupling between transfer function elements. Consider a 
distillation process where at steady-state 


Be Ee —86.4 


108.2 —109.6 ee) 


I A=RGA(G) = | oe al 


—34.1 35.1 
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Ww z 
. 
(a) Original system with multiple perturbations 
Perea pay ogee el 
I I 
eh ! 
; : 
! ! 
I I 
Poo eee Se J 
Ww z 


N 


(b) Pulling out the perturbations 


Figure 8.4: Rearranging an uncertain system into the NA-structure 
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From the large RGA-elements we know that G becomes singular for small relative 
changes in the individual elements. For example, from (6.88) we know that perturbing 
the 1,2-element from —86.4 to —88.9 makes G singular. Since variations in the 
steady-state gains of +50% or more may occur during operation of the distillation 
process, this seems to indicate that independent control of both outputs is impossible. 
However, this conclusion is incorrect since, for a distillation process, G never 
becomes singular. The reason is that the transfer function elements are coupled 
due to underlying physical constraints (e.g., the material balance). Specifically, for 
the distillation process a more reasonable description of the gain uncertainty is 
(Skogestad et al., 1988) 


i iG 
a= a+w| §, ral 6] <1 (8.7) 


and for the numerical data above detGp = detG irrespective of 6, so Gp is never 
singular for this uncertainty. (Note that det G, = det G is not generally true for the 
uncertainty description given in (8.7).) 


Exercise 8.1 The uncertain plant in (8.7) may be represented in the additive 
uncertainty form Gp = G+ W2AaW, where Ay = 6 is a single scalar perturbation. 
Find Wi and Wo. 


8.2.2 Parametric uncertainty 


The representation of parametric uncertainty, as discussed in Chapter 7 for 
SISO systems, carries straight over to MIMO systems. However, the inclusion 
of parametric uncertainty may be more significant for MIMO plants because it 
offers a simple method of representing the coupling between uncertain transfer 
function elements. For example, the simple uncertainty description used in 
(8.7) originated from a parametric uncertainty description of the distillation 
process. 


8.2.3. Unstructured uncertainty 


Unstructured perturbations are often used to get a simple uncertainty model. 
We here define unstructured uncertainty as the use of a “full” complex 
perturbation matrix A, usually with dimensions compatible with those of the 
plant, where at each frequency any A(jw) satisfying ¢(A(jw)) < 1 is allowed. 

Six common forms of unstructured uncertainty are shown in Figure 8.5. 
In Figure 8.5 (a), (b) and (c), are shown three feedforward forms; additive 
uncertainty, multiplicative input uncertainty and multiplicative output 
uncertainty: 


Il,: Gp =G+ Ea; Eq = waka (8.8) 
II;: G, = G(I+ En); EF; =wyAyr (8.9) 
Ilo: Gp = (I+ Eo)G; Eo =woAo (8.10) 
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Figure 8.5: (a) Additive uncertainty, (b) Multiplicative input uncertainty, (c) 
Multiplicative output uncertainty, (d) Inverse additive uncertainty, (e) Inverse 
multiplicative input uncertainty, (f) Inverse multiplicative output uncertainty. 


In Figure 8.5 (d), (e) and (f), are shown three feedback or inverse forms; 
inverse additive uncertainty, inverse multiplicative input uncertainty and 
inverse multiplicative output uncertainty: 


Ta: Gp = G(r — Ei;aG)"}; Bia = wiaDia (8.11) 
Thr: Gp = G(r _ Eixr)}; a wir Air (8.12) 
Iljo: Gp = (I _— Eio) 1G; Eio = wiodso (8.13) 


The negative sign in front of the E’s does not really matter here since we will 
assume that A can have any sign. A denotes the normalized perturbation and 
E the “actual” perturbation. We have here used scalar weights w, so E = wA, 
but sometimes one may want to use matrix weights, FE = W2AW, where W, 
and W, are given transfer function matrices. 

Another common form of unstructured uncertainty is coprime factor 
uncertainty discussed later in Section 8.6.3. 


Remark. In practice, one can have several perturbations which themselves are 
unstructured. For example, we may have A; at the input and Ao at the output, 
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which may be combined into a larger perturbation, A = diag{A;, Ao}. However, 
this A is a block-diagonal matrix and is therefore no longer unstructured. 


Lumping uncertainty into a single perturbation 


For SISO systems we usually lump multiple sources of uncertainty into a single 
complex perturbation; often in multiplicative form. This may also be done for 
MIMO systems, but then it makes a difference whether the perturbation is at 
the input or the output. 

Since output uncertainty is usually less restrictive than input uncertainty 
in terms of control performance (see Section 6.10.4), we usually first attempt 
to lump the uncertainty at the output. For example, a set of plants II may be 
represented by multiplicative output uncertainty with a scalar weight wo(s) 
using 

G, = (I+wodAo)G, ||Aollo <1 (8.14) 


where, similar to (7.14), 


low) = max @ ((Gp— G)G™!(jw))); |wo(jw)| > lo(w) Vw (8.15) 


and we can use the pseudo-inverse if G is singular. If the resulting uncertainty 
weight is reasonable (i.e. it must at least be less than 1 in the frequency 
range where we want control), and the subsequent analysis shows that robust 
stability and performance may be achieved, then this lumping of uncertainty 
at the output is fine. If this is not the case, then one may try to lump the 
uncertainty at the input instead, using multiplicative input uncertainty with 
a scalar weight, 

G,=G(I+wrAr), ||Arllo <1 (8.16) 


where, similar to (7.14), 


(2) =a (G7"(Gp — G)(jw))); lwr(jw)|2U(~) vw (8.17) 
However, in many cases this approach of lumping uncertainty either at the 
output or the input does not work well. This is because one cannot in general 
shift a perturbation from one location in the plant (say at the input) to 
another location (say the output) without introducing candidate plants which 
were not present in the original set. In particular, one should be careful when 
the plant is ill-conditioned. This is discussed next. 


Moving uncertainty from the input to the output 


For a scalar plant, we have G, = G(1+w7;Az) = (1+ woAo)G and we 
may simply “move” the multiplicative uncertainty from the input to the 
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output without changing the value of the weight, i.e., wr = wo. However, 
for multivariable plants we usually need to multiply by the condition number 
7(G) as is shown next. 

As an example, suppose the true uncertainty is represented as input 
multiplicative uncertainty on the form 


G, =G(I+ Er) (8.18) 
Then from (8.17) the magnitude of multiplicative input uncertainty is 


Iw) = max a(G7! (Gy — G)) = max a(Erz) (8.19) 


Ey 


On the other hand, if we want to represent (8.18) as multiplicative output 
uncertainty, then from (8.15) 


lo(w) = max o((Gp —G)G')= max o(GE;G"") (8.20) 


which is much larger than /;(w) if the condition number of the plant is 
large. To see this, write Ey = w;A; where we allow any Ar(jw) satisfying 
a(Ar(jw)) < 1,Vw. Then at a given frequency 


lo(w) = |wr| max o(GArG~") = |wr(jw)| 1(G(sw)) (8.21) 
I 
Proof: Write at each frequency G = USV" and G7! = USV". Select Ar = VU# 


(which is a unitary matrix with all singular values equal to 1). Then ¢(GArG~') = 


a(USEV") = a(SSigma) = a(G)a(G~') = 7(G). 


Example 8.2 Assume the relative input uncertainty is 10%, that is, wr = 0.1, 
and the condition number of the plants is 141.7. Then we must select lo = wo = 
0.1 x 141.7 = 14.2 in order to represent this as multiplicative output uncertainty 
(this is larger than 1 and therefore not useful for controller design). 


Also for diagonal perturbations (E; diagonal) we may have a similar 
situation. For example, if the plant has large RGA-elements then the elements 
in GE;G~' will be much larger than those of E;, see (A.79, making it 
impractical to move the uncertainty from the input to the output. 


Example 8.3 Let II be the set of plants generated by the additive uncertainty in 
(8.7) with w = 10 (corresponding to about 10% uncertainty in each element). Then 
from (8.7) one plant G' in this set (corresponding to 6 =1) has 


3 10 —10 
G=G4 Ee a (8.22) 
for which we have ly = a(G~'(G' — G)) = 14.3. Therefore, to represent G' in terms 


of input uncertainty we would need a relative uncertainty of more than 1400%. This 
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would imply that the plant could become singular at steady-state and thus impossible 
to control, which we know is incorrect. Fortunately, we can instead represent this 
additive uncertainty as multiplicative output uncertainty (which is also generally 
preferable for a subsequent controller design) with lo = a((G’ — G)G~') = 0.10. 
Therefore so output uncertainty works well for this particular example. 


Conclusion. Ideally, we would like to lump several sources of uncertainty 
into a single perturbation to get a simple uncertainty description. Often an 
unstructured multiplicative output perturbations is used. However, from the 
above discussion we have learnt that we should be careful about doing this, at 
least for plants with a large condition number. In such cases we may have to 
represent the uncertainty as it occurs physically (at the input, in the elements, 
etc.) thereby generating several perturbations. 

For uncertainty associated with unstable plant poles, we should use one of 
the inverse forms in Figure 8.5. 


Figure 8.6: Uncertain plant, y = Gpu, represented by LFT, see (8.23). 


Exercise 8.2 A fairly general way of representing an uncertain plant Gy is in 
terms of a linear fractional transformation (LFT) of A as shown in Figure 8.6. 
Here 


Gy = Fu( ben ale = Pe + Pu AU — Fur A)7' Fis (8.23) 
where G = Fo is the nominal plant model. Obtain F for each of the siz 


uncertainty forms in (8.8)-(8.18) using E = W2AW, (Hint for the inverse forms: 
(I — Wi AWe)7! = 14+ Wi AU — WeWiA)~'Wae, see (3.7)-(8.9). 


Exercise 8.3 Find F in Figure 8.6 for the uncertain plant in Figure 7.22. 


8.2.4 Diagonal uncertainty 


By “diagonal uncertainty” we mean that the perturbation is a complex 
diagonal matrix 


A(s) = diag{dé;(s)}; |6;(jw)| < 1, Vw, Vi (8.24) 
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(usually of the same size as the plant). For example, this is the case if 
A is diagonal in any of the six uncertainty forms in Figure 8.5. Diagonal 
uncertainty arises from a consideration of uncertainty or neglected dynamics in 
the individual input channels (actuators) or in the individual output channels 
(sensors). This type of diagonal uncertainty is always present, and since it has 
a scalar origin it may be represented using the methods presented in Chapter 
7. To make this clearer, let us consider uncertainty in the input channels. 
With each input wu; there is associated a separate physical system (amplifier, 
signal converter, actuator, valve, etc.) which based on the controller output 
signal, u;, generates a physical plant input m; 


The scalar transfer function h;(s) is often absorbed into the plant model G(s), 
but for representing the uncertainty it is important to notice that it originates 
at the input. We can represent this actuator uncertainty as multiplicative 
(relative) uncertainty given by 


hpi(s) = hi(s)(1+ wri(s)di(s));  [6i(Gw)| <1, Vw (8.26) 


which after combining all input channels results in diagonal input uncertainty 
for the plant 


Gp(s) =G(1+WyA;z); Ar = diag{é;}, Wr = diag{wr} (8.27) 


Normally we would represent the uncertainty in each input or output channel 
using a simple weight in the form given in (7.27), namely 
TS+Y1O 
w(s) = ————_ 8.28 
(s) (t/Too)s +1 ee) 
where 79 is the relative uncertainty at steady-state, 1/7 is (approximately) 
the frequency where the relative uncertainty reaches 100%, and ro is the 
magnitude of the weight at higher frequencies. Typically, the uncertainty |w], 
associated with each input, is at least 10% at steady-state (79 > 0.1), and it 
increases at higher frequencies to account for neglected or uncertain dynamics 
(typically, roo > 2). 


Remark 1 The diagonal uncertainty in (8.27) originates from independent scalar 
uncertainty in each input channel. If we choose to represent this as unstructured 
input uncertainty (Ar is a full matrix) then we must realize that this will introduce 
non-physical couplings at the input to the plant, resulting in a set of plants which is 
too large, and the resulting robustness analysis may be conservative (meaning that 
we may incorrectly conclude that the system may not meet its specifications). 


Remark 2 The claim is often made that one can easily reduce static (gain) input 
uncertainty to significantly less than 10%, but this is probably not true in most 
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cases. Consider again (8.25). A commonly suggested method to reduce static gain 
uncertainty is to measure the actual input (m:) and employ local feedback (cascade 
control) to readjust u:. As a simple example, consider a bathroom shower, in which 
the input variables are the flows of hot and cold water. One can then imagine 
measuring these flows and using cascade control so that each flow can be adjusted 
more accurately. However, even in this case there will be uncertainty related to the 
accuracy of each measurement. Note that it is not the absolute measurement error 
that yields problems, but rather the error in the sensitivity of the measurement 
with respect to changes (i.e., the “gain” of the sensor). For example, assume that 
the nominal flow in our shower is 1 1/min and we want to increase it to 1.1 1/min, 
that is, in terms of deviation variables we want u = 0.1 [l/min]. Suppose the vendor 
guarantees that the measurement error is less than 1%. But, even with this small 
absolute error, the actual flow rate may have increased from 0.99 1/min (measured 
value of 1 1/min is 1% too high) to 1.11 1/min (measured value of 1.1 1/min is 1% 
too low), corresponding to a change u’ = 0.12 [l/min], and an input gain uncertainty 
of 20%. 


In conclusion, diagonal input uncertainty, as given in (8.27), should always be 
considered because: 


1. It is always present and a system which is sensitive to this uncertainty will 
not work in practice. 
2. It often restricts achievable performance with multivariable control. 


8.3. Obtaining P, N and M 


We here illustrate, by way of an example, how to obtain the interconnection 
matrices P, N and M in a given situation. 


Figure 8.7: System with multiplicative input uncertainty and performance 
measured at the output 


Example 8.4 System with input uncertainty. Consider a feedback system 
with multiplicative input uncertainty A; as shown in Figure 8.7. Here Wy is a 
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normalization weight for the uncertainty and Wp is a performance weight. We want 
to derive the generalized plant P in Figure 8.1 which has inputs [ua w ule and 
outputs [ya 2 v]". By writing down the equations (e.g. see Example 3.13) or 
simply inspecting Figure 8.7 (remember to break the loop before and after K ) we get 


0 0 Ww 
P=|WpG Wp WeG (8.29) 
eG” of, Tag 


It is recommended that the reader carefully derives P (as instructed in Exercise 8.4). 
Note that the transfer function from ua to ya (upper left element in P) is 0 because 
ua has no direct effect on ya (except through K ). Next, we want to derive the matriz 
N corresponding to Figure 3.21. First, partition P to be compatible with K, i.e. 


= 0 0 _ | Wr 
Pus Ee es , Peas Ee (8.30) 
Px =[-G —I], Pr G (8.31) 


and then find N = F(P, K) using (8.2). We get (see Exercise 8.6) 
N= -WrKG(I+ KG)"! -WrK(I1+GKk)7! (8.32) 
~ | WeGU+KkG) Wpe(I+GkK)"! ; 


Alternatively, we can derive N directly from Figure 8.7 by evaluating the closed-loop 
transfer function from inputs [ua w]" to outputs[ya 2] (without breaking the 
loop before and after K.) For example, to derive Ni2, which is the transfer function 
from w to ya, we start at the output (ya) and move backwards to the input (w) 
using the MIMO Rule in Section 3.2 (we first meet Wr, then —K and we then exit 
the feedback loop and get the term (I+ GK)7'). 

The upper left block, Nii, in (8.82) is the transfer function from ua to ya. This 
is the transfer function M needed in Figure 8.3 for evaluating robust stability. Thus, 
we have M = —W;KG(I4+ KG)~' = —WyrTr. 


Exercise 8.4 Show in detail how P in (8.29) is derived. 


Exercise 8.5 For the system in Figure 8.7 we see easily that the uncertain transfer 
function from w to z is F = We(L+G(I+ WrAr)K)~'. Show that this is identical 
to Fu(N, A) evaluated using (8.35) where from (8.32) we have Ni = —WrT7, 
Ni2 = —WrKS, Noi =WeSG and Noz = WpS. 


Exercise 8.6 Derive N in (8.32) from P in (8.29) using the lower LFT in (8.2). 
You will note that the algebra is quite tedious, and that it is much simpler to derive 
N directly from the block diagram as described above. 


Remark. Of course, deriving N from P is straightforward using available software. 
For example, in the MATLAB p-toolbox we can evaluate N = Fi(P, K) using the 
command N=starp(P,K) , and with a specific A the perturbed transfer function 
F(N, A) from w to z is obtained with the command starp(delta,N). 
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Exercise 8.7 Derive P and N for the case when the multiplicative uncertainty is 
at the output rather than the input. 


Exercise 8.8 Find P for the uncertain system in Figure 7.18. 


Exercise 8.9 Find P for the uncertain plant Gp in (8.23) when w = r and 
zZ=y-r. 


Exercise 8.10 Find the interconnection matrix N for the uncertain system in 
Figure 7.18. What is M ? 


Exercise 8.11 Find the transfer function M = Nii for studying robust stability 
for the uncertain plant Gp in (8.23). 


Figure 8.8: System with input and output multiplicative uncertainty 


Exercise 8.12 MA-structure for combined input and output uncertain- 
ties. Consider the block diagram in Figure 8.8 where we have both input and output 
multiplicative uncertainty blocks. The set of possible plants is given by 


Gy = (I+ W20 AoW10) G(T + War ArWir) (8.33) 


where ||Arllo < 1 and ||Aollo < 1. Collect the perturbations into A = 
diag{Ar, Ao} and rearrange Figure 8.8 into the MA-structure in Figure 8.3 Show 
that 


Wir 0 -T; —-KS War 0 
w=| lee ae 0 rm oH 


8.4 Definition of robust stability and 
performance 
We have discussed how to represent an uncertain set of plants in terms of the 


NA-structure in Figure 8.2. The next step analysis where we check whether 
we have stability and acceptable performance for all plants in the set: 
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1. Robust stability (RS) analysis: with a given controller K we determine 
whether the system remains stable for all plants in the uncertainty set. 

2. Robust performance (RP) analysis: if RS is satisfied, we determine how 
“large” the transfer function from exogenous inputs w to outputs z may 
be for all plants in the uncertainty set. 


Before proceeding we need to define more performance more precicely. 
In Figure 8.2, w represents the exogenous inputs (normalized disturbances 
and references), and z the exogenous outputs (normalized errors). We have 
z = F(A)w where from (8.3) 


F 2 F,(N, A) = No + Nu AI — Nu A) Nie (8.35) 


We here use the H,. norm to define performance and require for RP that 
|F(A)|lo < 1 for all allowed A’s. A typical choice is F = wpS,, the 
weighted sensitivity function, where wp is the performance weight (capital P 
for performance) and S, represents the set of perturbed sensitivity functions 
(lower-case p for perturbed). 

In terms of the NA-structure in Figure 8.2 our requirements for stability 
and performance can then be summarized as follows 


NS © Nis internally stable (8.36) 
NP & |[Noplloo <1; and NS (8.37) 
RS & F=F,(N,A) is stable VA,||Alloo <1; andNS (8.38) 
RP & |[Fllo <1, WA,||Alloo <1; and NS (8.39) 


Remark 1 Important. As a prerequisite for nominal performance (NP), robust 
stability (RS) and robust performance (RP), we must first satisfy nominal stability 
(NS). The reason is that our frequency-by-frequency conditions can be satisfied 
also for unstable systems. 


Remark 2 The definitions of RS and RP are useful only if we can test them in 
an efficient manner, that is, without having to search through the infinite set of 
allowable perturbations A. We will show how this can be done by introducing 
the structured singular value, 4, as our analysis tool. At the end of the chapter 
we also discuss how to synthesize controllers such that we have “optimal robust 
performance” by minimizing yp over the set of stabilizing controllers. 


Remark 3 Convention for inequalities. In this book we use the convention 
that the perturbations are bounded to be less than or equal to one. This results 
in a stability condition with a strict inequality, for example, RS V||Al|.. < 1 if 
|| M|lo. < 1. (We could alternatively have bounded the uncertainty with a strict 
inequality, yielding the equivalent condition RS V||Alloo < 1 if ||M]loo <1.) 
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Remark 4 Allowed perturbations. For simplicity below we will use the 
shorthand notation 
VA and max (8.40) 


to mean “for all A’s in the set of allowed perturbations”, and “maximizing over all 
A’s in the set of allowed perturbations”. By allowed perturbations we mean that A 
has Ho. norm less than 1, ||A||.. < 1, and that A has a specified block-diagonal 
structure where certain blocks may be restricted to be real. To be mathematically 
exact, we should replace A in (8.40) by A € Ba, where 


Ba ={A€EA: ||Alloo <1} 


is the set of unity norm-bounded perturbations with a given structure A. The 
allowed structure should also be defined, for example by 


A= {diag [O1lri,..-,6sIrg, At, a ., Ar] : 63 € R, A; E ci xm 


where in this case S denotes the number of real scalars (some of which may 
be repeated), and F the number of complex blocks. This gets rather involved. 
Fortunately, this amount of detail is rarely required as it is usually clear what we 
mean by “for all allowed perturbations” or “VA”. 


8.5 Robust stability of the MA-structure 


Consider the uncertain NA-system in Figure 8.2 for which the transfer 
function from w to z is, as in (8.35), given by 


FCN, A) = Noo + No, A(T = Ny A) Ny. (8.41) 


Suppose that the system is nominally stable (with A = 0), that is, N is stable 
(which means that the whole of N, and not only Noo must be stable ). We 
also assume that A is stable. We then see directly from (8.41) that the only 
possible source of instability is the feedback term (I — Ni,A)~1. Thus, when 
we have nominal stability (NS), the stability of the system in Figure 8.2 is 
equivalent to the stability of the MA-structure in Figure 8.3 where M = Ni. 

We thus need to derive conditions for checking the stability of the MA- 
structure. The next theorem follows from the generalized Nyquist Theorem 
4.14. It applies to H. norm-bounded A-perturbations, but as can be seen 
from the statement it also applies to any other convex set of perturbations 
(e.g., sets with other structures or sets bounded by different norms). 


Theorem 8.1 Determinant stability condition (Real or complex 
perturbations). Assume that the nominal system M(s) and the 
perturbations A(s) are stable. Consider the conver set of perturbations A, 
such that if A’ is an allowed perturbation then so is cA’ where c is any real 
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scalar such that |c| <1. Then the MA-system in Figure 8.3 is stable for all 
allowed perturbations (we have RS) if and only if 


Nyquist plot of det (I — MA(s)) does not encircle the origin, VA (8.42) 
= |det(I—-MA(jw)) #0, Vw,VA (8.43) 

& (MA)#1, Vi,vw,VA (8.44) 

Proof: Condition (8.42) is simply the generalized Nyquist Theorem (page 158) 


applied to a positive feedback system with a stable loop transfer function MA. 

(8.42) = (8.43): This is obvious since by “encirclement of the origin” we also 
include the origin itself. 

(8.42) < (8.43) is proved by proving not(8.42) > not(8.43): First note that with 
A = 0, det(J — MA) = 1 at all frequencies. Assume there exists a perturbation 
A’ such that the image of det(J — MA'(s)) encircles the origin as s traverses the 
Nyquist D-contour. Because the Nyquist contour and its map is closed, there then 
exists another perturbation in the set, A” = cA’ with « € [0,1], and an w’ such that 
det (I — MA" (jw')) =0. 

(8.44) is equivalent to (8.43) since det(I — A) = J], \s(J — A) and As(J — A) = 
1 — \;(A) (see Appendix A.2.1). 


The following is a special case of Theorem 8.1 which applies to complex 
perturbations. 


Theorem 8.2 Spectral radius condition for complex perturbations. 

Assume that the nominal system M(s) and the perturbations A(s) are 
stable. Consider the class of perturbations, A, such that if A’ is an allowed 
perturbation then so is cA’ where c is any complex scalar such that |c| < 1. 
Then the MA- system in Figure 8.8 is stable for all allowed perturbations (we 
have RS) if and only if 


o(MA(jw)) <1, Ww, VA (8.45) 


or equivalently 
RS © max p(MA(jw)) <1, Ww (8.46) 


Proof: (8.45) = (8.43) (@ RS) is “obvious”: It follows from the definition of the 
spectral radius p, and applies also to real A’s. 

(8.43) = (8.45) is proved by proving not(8.45) => not(8.43): Assume there exists 
a perturbation A’ such that p(MA’) = 1 at some frequency. Then |\;(MA’)| = 1 for 
some eigenvalue i, and there always exists another perturbation in the set, A” = cA’ 
where c is a complex scalar with |c| = 1, such that \;(M A") = +1 (real and positive 
and therefore det(J — MA") = JJ, (I — MA") = J],(1 — As(MA")) = 0. Finally, 
the equivalence between (8.45) and (8.46) is simply the definition of maxa. 


Remark 1 The proof of (8.45) relies on adjusting the phase of \;(McA’) using the 
complex scalar c and thus requires the perturbation to be complex. 
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Remark 2 In words, Theorem 8.2 tells us that we have stability if and only if 
the spectral radius of MA is less than 1 at all frequencies and for all allowed 
perturbations, A. The main problem here is of course that we have to test the 
condition for an infinite set of A’s, and this is difficult to check numerically. 


Remark 3 Theorem 8.1, which applies to both real and complex perturbations, 
forms the basis for the general definition of the structured singular value in (8.72). 


8.6 RS for complex unstructured uncertainty 


In this section, we consider the special case where A(s) is allowed to be any 
(full) complex transfer function matrix satisfying ||A||.. < 1. This is often 
referred to as unstructured uncertainty or as full-block complex perturbation 
uncertainty. 


Lemma 8.3 Let A be the set of all complex matrices such that (A) < 1. 
Then the following holds 


max p(MA) = max a(MA) = max a(A)a(M) =a(M) (8.47) 


Proof: In general, the spectral radius (p) provides a lower bound on the spectral 
norm (a) (see (A.114)), and we have 


max p(M A) < max (MA) < max a(A)o(M) =a(M) (8.48) 


where the second inequality in (8.48) follows since (AB) < a(A)a(B). Now, we need 
to show that we actually have equality. This will be the case if for any MM there exists 
an allowed A’ such that p(MA’) = a(M). Such a A’ does indeed exist if we allow 
A’ to be a full matrix such that all directions in A’ are allowed: Select A’ = VU" 
where U and V are matrices of the left and right singular vectors of M = UNV". 
Then a(A’) = 1 and p(MA’) = p(UUSV" VU") = p(USU") = p(©) = G(M). The 
second to last equality follows since U” = U~' and the eigenvalues are invariant 
under similarity transformations. 


that is, 
Lemma 8.3 together with Theorem 8.2 directly yields the following theorem: 


Theorem 8.4 RS for unstructured (“full”) perturbations. Assume 
that the nominal system M(s) is stable (NS) and that the perturbations A(s) 
are stable. Then the MA-system in Figure 8.3 is stable for all perturbations 
A satisfying ||Allo <1 (i.e. we have RS) if and only if 


a(M(jw))<1 Vw = {||Mllo<1 (8.49) 
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Remark 1 Condition (8.49) may be rewritten as 
RS © a(M(jw)) ¢(A(jw)) <1, Vw, VA, (8.50) 


The sufficiency of (8.50) (=>) also follows directly from the small gain theorem by 
choosing L = MA. The small gain theorem applies to any operator norm satisfying 
|ABl| < |All IIL 


Remark 2 An important reason for using the 7H. norm to describe model 
uncertainty, is that the stability condition in (8.50) is both necessary and sufficient. 
In contrast, use of the 72 norm yields neither necessary nor sufficient conditions for 
stability. We do not get sufficiency since the H2 norm does not in general satisfy 
ABI < [All IBIl 


8.6.1 Smallest destabilizing perturbation 


Given ||M||o = y where a(M(jwo)) = 7, we want to find the smallest 
destabilizing perturbation A’, which satisfies ||A’||.. = 1/7. To generate 
such a perturbation follow the proof of Lemma 8.3. At frequency wo perform 
an SVD of M(jwo) = UNV". One destabilizing perturbation is then A’ = 
svU", which yields M(jw)A’ = SUDU" which, since the eigenvalues are 
independent of similarity transformations, has the same eigenvalues as the 
matrix 4¥, which has one eigenvalue equal to 1, ie., \1(M(jwo)A’) = 1. 
Note that the destabilizing perturbation is not unique. For example, one may 
instead select A’ = $1 (ui) where v and uy are the fist columns of V and 
U, respectively. 


0 2 


Example 8.5 Consider M, = Fe 0 


for which (M1) =y =2. An SVD of Mi 


yields U =I and V = [‘ a and we find the smallest destabilizing perturbation 
AS foi(u)” = (9 Al which has a(A') = 0.5. Note that p(Mi) = 0, but 
MA’ = F al so we have as expected p(M1A') = 1. 


8.6.2 Application of the unstructured RS-condition 


We will now present necessary and sufficient conditions for robust stability 
(RS) for each of the six single unstructured perturbations in Figure 8.5. with 


E=WAWy, Allo <1 (8.51) 


To derive the matrix M we simply “isolate” the perturbation, and determine 
the transfer function matrix 


M =WiMW> (8.52) 
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from the output to the input of the perturbation, where Mo for each of the 
six cases becomes (disregarding some negative signs which do not affect the 
subsequent robustness condition) is given by 


Gp=G+Ea: Mo=K(I+GK)'=KS (8.53 
G)=GUI+Er: My=K(I+GK)'G=T; (8.54 
Gp=(I+Fo)G: Mo =GK(I+GK)1=T (8.55 

G,=GI-EaG)': Mo=(U+GK)"'G=SG (8.56 
G,p=G(I-Ejyz)t: Mo=(1+KG)1=S (8.57 
G,=(1- Eijo) 'G: Mo =(14+GK)'=8S (8.58 


For example, (8.54) and (8.55) follow from the diagonal elements in the M- 
matrix in (8.34), and the others are derived in a similar fashion. Note that 
the sign of Mo does not matter as it may be absorbed into A. Theorem 8.4 
then yields 

RS &  |WiMoW2(jw)|lo < 1,V w (8.59) 


For instance, from (8.54) and (8.59) we get for multiplicative input uncertainty 
with a scalar weight: 


RS VG, = G(I + wrAz), ||Arlloo <1 & |lwrFrlloo <1 (8.60) 


Note that the SISO-condition (7.32) follows as a special case of (8.60) 
Similarly, (7.51) follows as a special case of the inverse multiplicative output 
uncertainty in (8.58): 
RS VG = (I = wiodio) 'G, |Asolloo <1s |]wioS||loo <1 (8.61) 
In general, the unstructured uncertainty descriptions in terms of a single 
perturbation are not “tight” (in the sense that at each frequency all complex 
perturbations satisfying a(A(jw)) < 1 may not be possible in practice). 
Thus, the above RS-conditions are often conservative. In order to get tighter 
conditions we must use a tighter uncertainty description in terms of a block- 
diagonal A. 


8.6.3. RS for coprime factor uncertainty 


Robust stability bounds in terms of the H. norm (RS © ||M|lo < 1) 
are in general only tight when there is a single full perturbation block. An 
“exception” to this is when the uncertainty blocks enter or exit from the same 
location in the block diagram, because they can then be stacked on top of 
each other or side-by-side, in an overall A which is then a full matrix. If we 
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Figure 8.9: Coprime uncertainty. 


norm-bound the combined (stacked) uncertainty, we then get a tight condition 
for RS in terms of ||M loo. 

One important uncertainty description that falls into this category is the 
coprime uncertainty description shown in Figure 8.9, for which the set of 
plants is 


Gp =(Mi+Am)'(Ni+An), ||[An Ant] lloo <€ (8.62) 


where G = M,N is a left coprime factorization of the nominal plant, see 
(4.20). This uncertainty description is surprisingly general, it allows both zeros 
and poles to cross into the right-half plane, and has proved to be very useful 
in applications (McFarlane and Glover, 1990). Since we have no weights on 
the perturbations, it is reasonable to use a normalized coprime factorization 
of the nominal plant; see (4.25). Also note that we choose not to normalize the 
perturbations to be less than 1 in this case. This is because this uncertainty 
description is most often used in a controller design procedure where the 
objective is to maximize the magnitude of the uncertainty (€) such that RS is 
maintained. 

In any case, to test for RS we can rearrange the block diagram to match 
the MA-structure in Figure 8.3 with 


Re ee eS 7 | (1+GK)1M (8.63) 
We then get from Theorem 8.4 
RS Vi|An Aasllo<e © ||Mlo < 1/e (8.64) 


The above robust stability result is central to the #1. loop-shaping design 
procedure discussed in Chapter 9. 
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Remark. In (8.62) we bound the combined (stacked) uncertainty, ||[ Ay An] |lo < 
€, which is not quite the same as bounding the individual blocks, ||Aw|loo < 1 and 

\|Aar|loc < 1. However, from (A.44) we see that these two approaches differ at most 

by a factor of /2, so it is not an important issue from a practical point of view. 


Exercise 8.13 Consider combined multiplicative and inverse multiplicative 
uncertainty at the output, Gp = (I-AioWio) (1+ AoWo)G, where we choose to 
norm-bound the combined uncertainty, ||[Aio Ao]|loo <1. Make a block diagram 
of the uncertain plant, and derive a necessary and sufficient condition for robust 
stability of the closed-loop system. 


8.7 RS with structured uncertainty: 
Motivation 


SAME UNCERTAINTY 


NEW M: DMD-! 


Figure 8.10: Use of block-diagonal scalings, AD = DA. 


Consider now the presence of structured uncertainty, where A = diag{A;} is 
block-diagonal. To test for robust stability we rearrange the system into the 
M A-structure and we have from (8.49) 


RS if o(M(jw)) <1,Vw (8.65) 


We have here written “if” rather than “if and only if” since this condition is 
only sufficient for RS when A has “no structure” (full-block uncertainty). The 
question is whether we can take advantage of the fact that A = diag{A;} is 
structured to obtain an RS-condition which is tighter than (8.65). One idea is 
to make use of the fact that stability must be independent of scaling. To this 
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effect, introduce the block-diagonal scaling matrix 
D = diag{d;I;} (8.66) 


where d; is a scalar and J; is an identity matrix of the same dimension as the 
ith perturbation block, A;. Now rescale the inputs and outputs to M and A 
by inserting the matrices D and D~! on both sides as shown in Figure 8.10. 
This clearly has no effect on stability. Next, note that with the chosen form 
for the scalings we have for each perturbation block A; = d; Aid; *, that. is, 
we have A = DAD-—!. This means that (8.65) must also apply if we replace 
M by DMD~! (see Figure 8.10), and we have 


RS if o(DMD"') <1,VWw (8.67) 


This applies for any D in (8.66), and “most improved” (least conservative) 
RS-condition is obtained by minimizing at each frequency the scaled singular 
value, and we have 


RS if min &(D(w)M(jw)D(w)7') <1,Ww (8.68) 
D(w)ED 


where D is the set of block-diagonal matrices whose structure is compatible 
to that of A, ie, AD = DA. We will return with more examples of this 
compatibility later. Note that when A is a full matrix, we must select D = dI 
and we have ¢(DMD~') = a(M), and so as expected (8.68) is identical to 
(8.65). However, when A has structure, we get more degrees of freedom in D, 
and (DM D~*) may be significantly smaller than o(M). 


Remark 1 Historically, the RS-condition in (8.68) directly motivated the 
introduction of the structured singular value, 4(M/), discussed in detail in the next 
section. As one might guess, we have that p(M) < minno(DMD~'). In fact, for 
block-diagonal complex perturbations we generally have that (MM) is very close to 
minp o(DMD~'). 


Remark 2 Other norms. Condition (8.68) is essentially a scaled version of the 

small gain theorem. Thus, a similar condition applies when we use other matrix 
norms. The MA-structure in Figure 8.3 is stable for all block-diagonal A’s which 
satisfy ||A(jw)|| < 1, Vw if 


min ||D(w)M(jw)D(w)7' || < 1, Vw (8.69) 
D(w)ED 

where D as before is compatible with the block-structure of A. Any matrix norm may 
be used; for example, the Frobenius norm, ||M/||r, or any induced matrix norm such 
as ||M||i1 (maximum column sum), ||/||ioo (maximum row sum), or ||M/|li2 = (1), 
which is the one we will use. Although in some cases it may be convenient to use 
other norms, we usually prefer ¢ because for this norm we get a necessary and 
sufficient RS-condition. 
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8.8 The structured singular value and RS 


The structured singular value (denoted Mu, mu, SSV or y) is a function which 
provides a generalization of the singular value, g, and the spectral radius, p. 
We will use ys to get necessary and sufficient conditions for robust stability 
and also for robust performance. The name “structured singular value” is 
used because ys generalizes the singular value RS-condition, ¢(M) < 1, Vw in 
(8.49), to the case when A has structure (and also to cases where parts of A 
are real). 

How is p defined? Consider stability of the MA-structure in Figure 8.3 for 
the case where A is a set of norm-bounded block-diagonal perturbations. From 
the determinant stability condition in (8.43) which applies to both complex 
and real perturbations we get 


RS #  det(I—- MA(jw)) #0, Vw,VA,a(A(jw))<1 Vw (8.70) 


A problem with (8.70) is that it is only a “yes/no” condition. To find the 
factor ky, which the system is robustly stable, we scale the uncertainty A by 
km and look for the smallest k,, which makes the matrix I — k,, MA singular, 
i.e. 


det (I — km MA) = 0 (8.71) 
and the structured singular value, jy, is simply defined as 1/km. 


Definition 8.5 Structured Singular Value. Let M be a given complez 
matriz and let A = diag{A;} denote a set of complex matrices with a(A) <1 
and with a given block-diagonal structure (in which some of the blocks may be 
repeated and some may be restricted to be real). The real non-negative function 
(MM), called the structured singular value, is defined by 


1 
min{k,,| det (I — ky, MA) = 0 for structured A,a(A) < 1} 


> 


(8.72) 


If no such structured A exists then u(M) =0. 


It is important to note that the value of u(M/) depends also on the structure 
of A. This is sometimes shown explicitly by using the notation wa(M). A 
value of 4s = 1 means that there exists a perturbation with (A) = 1 which 
is just large enough to make J — MA singular. A larger value of pz is “bad” 
as it means that a smaller perturbation makes J — MA singular, whereas a 
smaller value of 1: is “good”. 

Combining (8.70) with the definition of yz yields a necessary and sufficient 
condition for robust stability. 


Theorem 8.6 RS for block-diagonal perturbations (real or com- 
plex). Assume that the nominal system M and the perturbations A are 
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stable. Then the MA-system in Figure 8.3 is stable for all allowed perturba- 
tions with (A) <1, Vw, if and only if 


b(M(jw)) <1, Vw (8.73) 


Proof: u(M) < 1 km > 1, so if u(M) < 1 at all frequencies the required 
perturbation A to make det(J — MA) = 0 is larger than 1, and the system is 
stable. On the other hand, u(M) =1< km = 1, so if w(M) = 1 at some frequency 
there does exist a perturbation with (A) = 1 such that det(J — MA) = 0 at this 
frequency, and the system is unstable. 


Condition (8.73) for robust stability may be rewritten as 
RS © p(M(jw)) e(AGjw)) <1, Vw (8.74) 


which may be interpreted as a “generalized small gain theorem” that also 
takes into account the structure of A. 

One may argue whether Theorem 8.6 is really a theorem, or a restatement 
of the definition of js. In either case, we see from (8.73) that it is trivial to 
check for robust stability provided we can compute p. The focus of the next 
section is therefore to understand what yp is and how it can be computed. 

However, let us first consider two examples that illustrate how we use pu to 
check for robust stability with structured uncertainty. In the first example, the 
structure of the uncertainty is important, and an analysis based on the Hy 
norm would leads to the incorrect conclusion that the system is not robustly 
stable. In the second example the structure makes no difference. 


Example 8.6 RS with diagonal input uncertainty 


Magnitude 


Frequency 


Figure 8.11: Robust stability for diagonal input uncertainty is guaranteed 
since wa,(Tr) < 1/|wr|, Vw. The use of unstructured uncertainty and a(Tr) is 
conservative. 
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Consider robust stability of the feedback system in Figure 8.7 for the case when the 
multiplicative input uncertainty is diagonal. A nominal 2x 2 plant and the controller 
(which represents PI-control of a distillation process using the DV-configuration) is 
given by 
— 1+7s | —0.0015 0 


e SES, 0 0.075 | (8-75) 


1 87.8 14 
LG) Neaeorarer| ee a | 


(time in minutes). The controller results in a nominally stable system with acceptable 
performance. Assume there is multiplicative uncertainty in each manipulated input 
of magnitude fs 
s+0. 

wr(s) = 05ed1 (8.76) 
This implies a relative uncertainty of up to 20% in the low frequency range, which 
increases at high frequencies, reaching a value of 1 (100% uncertainty) at about 
1 rad/min. The increase with frequency allows for various neglected dynamics 
associated with the actuator and valve. The uncertainty may be represented as 
multiplicative input uncertainty as shown in Figure 8.7 where Ar is a diagonal 
compler matrit and the weight is Wr = wrl where wr(s) is a scalar. On rearranging 
the block diagram to match the MA-structure in Figure 8.3 we get M =wrKG(I+ 
KG)"! = wrTy; (recall (8.32)), and the RS-condition u(M) <1 in Theorem 8.6 
yields 


1 61 
RS EF —— Vv Ar= 8.77 
SH <a MH Or |" | oh 


This condition is shown graphically in Figure 8.11 and is seen to be satisfied at 
all frequencies, so the system is robustly stable. Also in Figure 8.11 (Tr) can be 
seen to be larger than 1/|wr(jw)| over a wide frequency range. This shows that the 
system would be unstable for full-block input uncertainty (Ar full). However, full- 
block uncertainty is not reasonable for this plant, and therefore we conclude that the 
use of the singular value is conservative in this case whichdemonstrates the need for 
the structured singular value. 


Exercise 8.14 Consider the same example and check for robust stability with 
full-block multiplicative output uncertainty of the same magnitude. (Solution: RS 
satisfied). 


Example 8.7 RS of spinning satellite. Recall Motivating Example No. 1 from 
Section 3.7.1 with the plant G(s) given in (3.72) and the controller K = I. We want 
to study how sensitive this design is to multiplicative input uncertainty. 

In this case T; = T, so for RS there is no difference between multiplicative input 
and multiplicative output uncertainty. In Figure 8.12, we plot p(T) as a function 
of frequency. We find for this case that u(T) = a(T) irrespective of the structure 
of the complex multiplicative perturbation (full-block, diagonal or repeated complex 
scalar). Since (TL) crosses 1 at about 10 rad/s, we can tolerate more than 100% 
uncertainty at frequencies above 10 rad/s. At low frequencies u(T) is about 10, so to 
guarantee RS we can at most tolerate 10% (complex) uncertainty at low frequencies. 
This confirms the results from Section 3.7.1, where we found that real perturbations 
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Figure 8.12: y-plot for robust stability of spinning satellite 


6, = 0.1 and 69 = —0.1 yield instability. Thus, the use of complex rather than real 
perturbations is not conservative in this case, at least for Ar diagonal. 

However, with repeated scalar perturbations (i.e. the uncertainty in each channel 
is identical) there is a difference between real and complex perturbations. With 
repeated real perturbations available software (using the command m with blk 
= [-2 0] in the p-toolbox in MATLAB) yields a peak p-value of 1, s0 we can 
tolerate a perturbation 61 = 62 of magnitude 1 before getting instability (This is 
confirmed by considering the characteristic polynomial in (3.76), from which we see 
that 61 = 62 = —1 yields instability.) On the other hand, with complex repeated 
perturbations, we have that u(T) = p(T) is 10 at low frequencies, so instability may 
occur with a (non-physical) complex 6, = 52 of magnitude 0.1. (Indeed, from (3.76) 
we see that the non-physical constant perturbation 6, = 62 = j0.1 yields instability.) 


8.9 Properties and computation of 1 


Recall from (8.72) the definition of the structured singular value 


a 


u(M)~! min{km| det (I — km MA) = 0 for some allowed A, a(A) < 1} 


(8.78) 


8.9.1 Remarks on the definition of 4 


1. The structured singular value was introduced by Doyle (1982). At the same 
time (in fact, in the same issue of the same journal) Safonov (1982) introduced 
the Multivariable Stability Margin k, for a diagonally perturbed system as the 
inverse of 4, that is, km(M) = p(M)~'. In many respects, as was illustrated 
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above, this is a more natural definition of a robustness margin. However, as 
already mentioned, (MM) has a number of other advantages, such as providing a 
generalization of the spectral radius, p(M), and the spectral norm, a(M). 

2. The A corresponding to the smallest k,, in (8.78) will always have a(A) = 1, 
since if det(I — ki, MA’) = 0 for some A’ with a(A’) = c < 1, then 1/k,, 
cannot be the structured singular value of M, since there exists a smaller scalar 
km = kine such that det(I — km MA) =0 where A = 1A’ and &(A) = 1. 

3. Note that with k, = 0 we obtain I — ky, MA = I which is clearly nonsingular. 
Thus, one possible way to obtain 4 numerically, is to start with k, = 0 , and 
gradually increase k,, until we first find an allowed A with a(A) = 1 such that 
(I — km MA) is singular (this value of km is then 1/1). By “allowed” we mean 
that A must have the specified block-diagonal structure and that some of the 
blocks may have to be real. 

4. The sequence of M and A in the definition of 4 does not matter. This follows 
from the identity (A.11) which yields 


det (I — km MA) = det(I — kmAM) (8.79) 

5. In most cases M and A are square, but this need not be the case. If they 

are nonsquare, then we make use of (8.79) and work with either MA or AM 
(whichever has the lowest dimension). 


6. Equivalent definition of u No.1. When defining jp we may choose to work directly 
with yu instead of its inverse, and we may replace (8.78) by the following definition 


u(M) S max{ | det(I — 1 MA) = 0 for structured A,a(A) < 1} (8.80) 
M KL 


7. Equivalent definition of 4 No.2. Alternatively, We may also vary the size of the 
perturbations (as was originally done by Doyle (1982)) to give the definition 


p(M)~! = min{a(A)| det(I — MA) = 0 for structured A} (8.81) 


Example 8.8 , ofascalar. If M isa scalar then in most cases u(M) = |M|. This 
follows from (8.81) by selecting |A| = 1/|M| such that (1— MA) = 0. However, this 
requires that we can select the phase of A such that MA is real, which is impossible 
when A is real and M has an imaginary component, so in this case u(M) =0. In 
summary, we have 


A complex : (MM) = |M| (8.82) 


|M| for real M 


nee Cs { 0 otherwise (8:88) 


The above example shows that may depend on whether the perturbation 
is real or complex. The following example demonstrates that ~ also depends 
on the structure of A. 


Example 8.9 Let A be complez and consider again the matrix M, in Example 


8.5. We get 
_—}0 2]. _ J 2, A full matrix 
Mas E | oa { 0, A diagonal 
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Proof: For the case when A is a full complex matrix we have already (in Example 
8.5) constructed a “worst-case” A' with o(A') = 0.5 which makes I-MA' singular, 
and it follows from (8.81) that u(M1) = 2. Next, consider the case when we restrict 
A to be diagonal. We have 


_ fo 0 _ [1 -262km 
a=[% 2], rsnma=[2 2] 


The determinant of this matriz is equal to 1 irrespective of the values of 61, 62 and 
km. So in this case no perturbation yields singularity and u(M1) =0. 


8.9.2 What do 1 41 and skewed-1; mean? 


A value of = 1.1 for robust stability means that all the uncertainty blocks 
must be decreased in magnitude by a factor 1.1 in order to guarantee stability. 

But if we want to keep some of the uncertainty blocks fixed, how large can 
one particular source of uncertainty be before we get instability? We define 
this value as 1/*, where yu* is called skewed-p. We may view u*(M) as a 
generalization of u(M). 

For example, let A = diag{A,, A>} and assume we have fixed ||A;|| < 1 and 
we want to find how large A» can be before we get instability. The solution 
is to select 


Ke E _ | (8.84) 


and look at each frequency for the smallest value of km which makes 
det(I — Km MA) = 0, and we have that skewed-, is 


p?(M) 2 1/km 


Note that to compute skewed-y we must first define which part of the 
perturbations is to be constant. y°(M) is always further from 1 than p(M) 
is, ie. uw? > pp for uw > 1, we = w for w= 1, and p® < pu for uw < 1. In practice, 
with available software to compute pz, we obtain p* by iterating on k» until 
[(Ky,M) = 1 where K,,, may be as in (8.84). This iteration is straightforward 
since yp increases uniformly with k,. 


8.9.3. Properties of » for real and complex 
perturbations 


Two properties of which hold for both real and complex perturbations are: 


1. u(aM) = |al|u(M) for any real scalar a. 
2. Let A = diag{A,, Az} be a block-diagonal perturbation (in which A; and 
As may have additional structure) and let M be partitioned accordingly. 
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Then 
Ba(M) > max{Ha, (Mir), Has (Moz) } (8.85 


Proof: Consider det(I — MA) where «. = fta(M) and use Schur’s formula in 
(A.13) with Ai. = J — aMuAi and Aiz =I — + M22A2. 


In words, (8.85) simply says that robustness with respect to two 
perturbations taken together is at least as bad as for the worst perturbation 
considered alone. This agrees with our intuition that we cannot improve 
robust stability by including another uncertain perturbation. 


In addition, the upper bounds given below for complex perturbations, 
eg. pa(M) < minpepo(DMD~?) in (8.95), also hold for real or mixed 
real/complex perturbations A. This follows because complex perturbations 
include real perturbations as a special case. However, the lower bounds, e.g. 
p(M) > p(M) in (8.90), only holds generally for complex perturbations. 


8.9.4 yw for complex perturbations 


When all the blocks in A are complex, may be computed relatively easily. 
This is discussed below and in more detail in the survey paper by Packard 
and Doyle (1993). Most of this follows from the following result, which may 
be viewed as another definition of j: that applies for complex A only. 


Lemma 8.7 For complex perturbations A with a(A) <1: 


B(M) = maxa,e(a)<i p(MA) (8.86) 


Proof: The lemma follows directly from the definition of ~ and the equivalence 
between (8.43) and (8.46). Alternatively, from Appendix A.2.1 we have 


a 1 
det(I aA) =|[ (: (ima) (8.87) 


a 


where \; denotes the 7’th eigenvalue. Consider a given A’ with a(A’) = 1 and 
p(MA') = r. If we assume that the set of allowed A’s includes complex matrices 
then there always exists in the set another A = cA’ (where c is a complex number 
with |c| = 1) such that \;(MA) = r. That is, we can adjust the phase of the 
largest: eigenvalue of MA’ such that it is real and positive. It then follows for the 
perturbation A’ that selecting 4 = r makes the determinant in (8.87) equal to zero. 
Since the structured singular value is found as the largest possible value of r (recall 
(8.80)), we have proved (8.86). 


Properties of ,, for complex perturbations 


Most of the following properties follow easily from (8.86). 
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1. u(aM) = |al|u(M) for any (complez) scalar a. 
2. For a repeated scalar complex perturbation we have 


A =6I (6 is a complex scalar): j(M) = p(M) (8.88 


Proof: Follows directly from (8.86) since there are no degrees-of-freedom for the 
maximization. 


3. For a full block complex perturbation we have from (8.86) and (8.47): 


A full matrix: p(M) =a(M) (8.89 


4. yw for complex perturbations is bounded by the spectral radius and the 
singular value (spectral norm): 


p(M) < u(M) < o(M) (8.90) 


This follows from (8.88) and (8.89), since selecting A = 6I gives the fewest 
degrees-of-freedom for the optimization in (8.86), whereas selecting A full 
gives the most degrees-of-freedom. 

5. Consider any unitary matrix U with the same structure as A. Then 


(MU) = p(M) = n(UM) (8.91 


Proof: Follows from (8.86) by writing MUA = MA' where a(A’) = a(UA) = 
a(A), and so U may always be absorbed into A. 


6. Consider any matrix D which commutes with A, that is, AD = DA. Then 


u(DM)=p(MD) and p(DMD™~*) = p(M) (8.92 
Proof: u(DM) = u(MD) follows from 


pa(DM) = max p(DM A) = max p( MAD) = max p(M DA) = pa(MD) 
(8.93) 
The first equality is (8.86). The second equality applies since p(AB) = p(BA) (by 
the eigenvalue properties in the Appendix). The key step is the third equality 


which applies only when DA = AD. The fourth equality again follows from 
(8.86). 


7. Improved lower bound. Define U/ as the set of all unitary matrices U 
with the same block-diagonal structure as A. Then 


(MM) = maxvey p(MU) (8.94) 


Proof: The proof of this important result is given by Doyle (1982) and Packard 
and Doyle (1993). It follows from a generalization of the maximum modulus 
theorem for rational functions. 
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10. 


11. 


The result (8.94) is motivated by combining (8.91) and (8.90) to yield 
> 
u(M) > max p(MU) 


The surprise is that this is always an equality. Unfortunately, the 
optimization in (8.94) is not convex and so it may be difficult to use in 
calculating 4 numerically. 


. Improved upper bound. Define D to be the set of matrices D which 


commute with A (i.e. satisfy DA = AD). Then it follows from (8.92) and 
(8.90) that 


u(M) < minnepo(DMD“") (8.95) 


This optimization is convex in D (i.e., has only one minimum, the global 
minimum) and it may be shown (Doyle, 1982) that the inequality is in 
fact an equality if there are 3 or fewer blocks in A. Furthermore, numerical 
evidence suggests that the bound is tight (within a few percent) for 4 blocks 
or more; the worst known example has an upper bound which is about 15% 
larger than pu (Balas et al., 1993). 

Some examples of D’s which commute with A are 


A = oI: D= full matrix (8.96) 
A = full matrix: D=dI (8.97) 
ft Ge Again ee = 1a: saay 6.96) 
A = diag{Ay (full), dol, 63, ba} ey Be diag{diI, Ds (full), d3, d4} (8.99) 


In short, we see that the structures of A and D are “opposites”. 


. Without affecting the optimization in (8.95), we may assume the blocks 


in D to be Hermitian positive definite, ie., D; = DF? > 0, and for scalars 
d; > 0 (Packard and Doyle, 1993). 

One can always simplify the optimization in (8.95) by fixing one of the 
scalar blocks in D equal to 1. For example, let D = diag{d,, do,...,dn}, 
then one may without loss of generality set d, = 1. 


Proof: Let D' = 7D and note that ¢(/DMD~') =a(D'MD'""). 


Similarly, for cases where A has one or more scalar blocks, one may simplify 
the optimization in (8.94) by fixing one of the corresponding unitary scalars 
in U equal to 1. This follows from Property 1 with |c| = 1. 

The following property is useful for finding 4(AB) when A has a structure 
similar to that of A or B: 


HA(AB) < o(A)paa(B) (8.100) 
HA(AB) < o(B)ppa(A) (8.101) 
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12. 


13. 


Here the subscript “AA” denotes the structure of the matrix AA, and 
“BA” denotes the structure of BA. 


Proof: The proof is from (Skogestad and Morari, 1988a). Use the fact that 
(AB) = maxa p(AAB) = maxa p(VB)a(A) where V = AA/a(A). When we 
maximize over A then V generates a certain set of matrices with a(V) < 1. Let 
us extend this set by maximizing over all matrices V with o(V) < 1 and with the 
same structure as AA. We then get (AB) < maxy p(V B)a(A) = pv (B)a(A). 


Some special cases of (8.100): 


(a) If A is a full matrix then the structure of AA is a full matrix, and we 
simply get u(AB) < a(A)a(B) (which is not a very exciting result since 
we always have u(AB) < a(AB) < a(A)a(B). 

(b) If A has the same structure as A (e.g. they are both diagonal) then 


uaA(AB) < a(A)ua(B) (8.102) 


Note: (8.102) is stated incorrectly in Doyle (1982) since it is not specified that 
A must have the same structure as A. 

If A = OI (i.e., A consists of repeated scalars), we get the spectral radius 
inequality p(AB) < a(A)u4(B). A useful special case of this is 


= 
ie) 
ees 


p(MA) < o(A)pa(M) (8.103) 


A generalization of (8.100) and (8.101) is: 


= 2/0 A 
HA(ARB) < a(R) ux E 0 (8.104) 
where A = diag{A,R}. The result is proved by (Skogestad and Morari, 
1988a). 
The following is a further generalization of these bounds. Assume M is an 
LFT of R: M = Ny, + Ny2R(I — No2R)~'No,. The problem is to find an 
upper bound on R, a(R) < c, which guarantees that a(M) < 1 when 
ba( Nii) < 1. Skogestad and Morari (1988a) show that the best upper 
bound is the c which solves 
7 Niu Nig -] 


| @Noe. Nes (8.105) 


where A = diag{A, R}, and c is easily computed using skewed-y. Given 
the y-condition uwa(M/) < 1 (for RS or RP), (8.105) may be used to derive 
a sufficient loop-shaping bound on a transfer function of interest, e.g. R 
may be S$, T, L, L“' or K. 
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Remark. Above we have used minp. To be mathematically correct, we should have 
used infp because the set of allowed D’s is not bounded and therefore the exact 
minimum may not be achieved (although we may be get arbitrarily close). The use 
of max, (rather than sup,) is mathematically correct since the set A is closed (with 
a(A) < 1). 


Example 8.10 Let M and A be compler 2 x 2 matrices. Then 


p(M) =|a+)] for A= 6I 
M= 3 ak u(M) = 2 |a\+|o| for A = diag{é,,62} 
a(M) = V/2\a\/? + 2|b/? for A a full matrix 


(8.106) 


Proof: For A = 61, u(M) = p(M) and p(M) = |a+b| since M is singular and 

its nonzero eigenvalue is 44(M) = tr(M) = at b. For A full, w(M) = a(M) 

and o(M) = 4/2\a|? + 2|b|? since M is singular and its nonzero singular value is 

a(M) = ||M|lr, see (A.124). For a diagonal A, it is interesting to consider these 

different proofs of the result u(M) = |a| + |b: 

1. A direct calculation based on the definition of pu. 

2. Use of the lower “bound” in (8.94) (which is always exact). 

3. Use of the upper bound in (8.95) (which is exact here since we have only two 
blocks). 


We will use approach (a) here and leave (b) and (c) for Exercises 8.15 and 8.16. 


We have 
ny [at oe 61 ai ve eS 
wae[e ]f* g]= [i] ara 


From (8.79) we then get 


a 
b 


det(I — MA) = det(I — AM) =1—[& | ] = 1-8 ~ 


The smallest 6; and 62 which make this matrix singular, i.e. 1 — ad, — b62 = 0, are 
obtained when |6,| = |62| = |6| and the phases of 6, and 52 are adjusted such that 


1 — |Ja| - |6| — |b] - |6] = 0. We get |6| = 1/(|a| + |b|), and from (8.81) we have that 
He = 1/|6| = |a| + (8). 


Example 8.11 Let M be a partitioned matrix with both diagonal blocks equal to 
zero. Then 


0 A p(M) = V/p(AB) for A= 61 
Mt E 0 = /a(A)a(B) for A = diag{A;, Az}, A; full 
Re , a(M) = max{a(A),a(B)} for A a full matrix 
M 


(8.107) 


Proof: From the definition of eigenvalues and Schur’s formula (A.13) we get 
\(M) = M(AB) and p(M) = p(AB) follows. For block-diagonal A, 
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uM) = Va(A)a(B) follows in a similar way using u(M) = maxa p(MA) = 
maxa,,A, p(AA2BA1), and then realizing that we can always select Ai and A2 such 
that p(AAi BAz) = a(A)a(B) (recall (8.47)). ¢(M) = max{a(A),a(B)} follows 
since o(M) = ,/p(M4M) where M" M = diag{ B™ B, A™ A}. 


Exercise 8.15 For M in (8.106) and a diagonal A show that u(M) = |a| + |b 
using the lower “bound” u(M) = maxy p(MU) (which is always exact). Hint: Use 
U = diag{e!?,1} (the blocks in U are unitary scalars, and we may fix one of them 
equal to 1). 


Exercise 8.16 For M in (8.106) and a diagonal A show that u(M) = |a| + |d| 
using the upper bound u(M) < minpo(DMD~') (which is exact in this case since 
D has two “blocks”). 

Solution: Use D = diag{d,1}. Since DMD~' is a singular matrix we have from 
(A.35) that 


a 


1 
70 


“(DMD~) =a | = Val? + |dal? + [b/d + [6 (8.108) 


which we want to minimize with respect to d. The solution is d = /|a|/|b| which 
gives w(M) = \/laP? + 2|ab] + BP = lal +d). 


Exercise 8.17 Let c be a complex scalar. Show that for 


—a . Mir Mi2| _ Mit cMie2 
A=diag{A;, Ao}: pw ce | = 1M Mas (8.109) 
Exercise 8.18 Let a,b,c andd be complex scalars. Show that for 
: ab ad ab ab 
A = diag{é1, 62} : ab | =u |e | = |ab| + |cd| (8.110) 


Does this hold when A is scalar times identity, or when A is full? (Answers: Yes 
and No). 


Exercise 8.19 Assume A and B are square matrices. Show by a counterexample 
that (AB) is not in general equal to (BA). Under what conditions is y(AB) = 
f( BA)? (Hint: Recall (8.92). 


Exercise 8.20 /f (8.102) were true for any structure of A then it would imply 
p(AB) < a(A)p(B). Show by a countererample that this is not true. 


8.10 Robust Performance 


Robust Performance (RP) means that the performance objective is satisfied 
for all possible plants in the uncertainty set, even the the worst-case plant. We 
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showed using Figure 7.20 that for a SISO system with an H. performance 
objective, the RP-condition was identical to the RS-condition for the same 
system with an additional perturbation block. 

This also holds for MIMO systems, and the steps in a derivation to show 
that RP may be viewed as a special case of structured RS are summarized 
in Figure 8.13. Step B is the key step and the reader is advised to study 
this carefully in the treatment below. Note that the block Ap (where capital 
P denotes Performance) is always a full matrix. It is a fictitious uncertainty 
block representing the #1. performance specification. 


8.10.1 Testing RP using pw 


To test for RP, we must first “pull out” the uncertain perturbations and 
rearrange the uncertain system into the NA-form of Figure 8.2. Our RP- 
requirement as given in (8.39) is that the H.. norm of the transfer function 
F = F,,(N, A) remains less than 1 for all allowed perturbations. This may be 
tested exactly by simply computing u(M) as stated in the following theorem. 


Theorem 8.8 Robust Performance. Rearrange the uncertain system into 
the NA-structure of Figure 8.13. Assume nominal stability such that N is 
(internally) stable. Then 


def 


RPS Fljc = FAN, A)lloo <1, WAllloo $1 (8-111) 
co 


LA(N(jw)) <1, Vw (8.112) 


where ys is computed with respect to the structure 


A= E a (8.113) 


and Ap is a full complex perturbation with the same dimensions as FT. 


Below we prove the theorem in two alternative ways, but first a few 
remarks: 


1. Condition (8.112) allows us to test if ||F||.. < 1 for all possible A’s without 
having to test each A individually. Essentially, y is defined such that it directly 
addresses the worst case. x 

2. The condition for RP involves the enlarged perturbation A = diag{A, Ap}. 
Here A, which itself may be a block-diagonal matrix, represents the true 
uncertainty, whereas Ap is a full complex matriz stemming from the Ho. norm 
performance specification. For example, for the nominal system (with A = 0) we 
get from (8.89) that ¢(No2) = pap(No2), and we see that Ap must be a full 


matrix. 
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3. Since A always has structure, the use of the H. norm, ||N|l.0 < 1, is generally 
conservative for robust performance. 
4. From (8.85) we have that 


px(N) > max{ ys (Nir), wap (No2)} (8.114) 
x 7 Sess SYS SS 
RP RS NP 


where as just noted wap (No2) = o(No2). (8.114) implies that RS (wa(Ni1) < 1) 
and NP (a(N22) < 1) are automatically satisfied when RP ((N) < 1) is satisfied. 
However, note that NS (stability of N) is not guaranteed by (8.112) and must 
be tested separately (Beware! It is a common mistake to get a design with 
apparently great RP, but which is not nominally stable and thus is actually 
robustly unstable). 

5. For a generalization of Theorem 8.8 see the main loop theorem of Packard and 
Doyle (1993), see also Zhou et al. (1996). 


Block diagram proof of Theorem 8.8 


In the following, let F = F,,(N, A) denote the perturbed closed-loop system 
for which we want to test RP. The theorem is proved by the equivalence 
between the various block diagrams in Figure 8.13. 

Step A. This is simply the definition of RP; ||F'l|oo. < 1. 

Step B (the key step). Recall first from Theorem 8.4 that stability of 
the MA-structure in Figure 8.3 where A is a full matrix is equivalent to 
|Z ||00 < 1. From this theorem we get that the RP-condition ||F'l|.. < 1 is 
equivalent to RS of the FAp-structure where Ap is a full matrix. 

Step C. Introduce F = F,,(N, A) from Figure 8.2. 

Step D. Collect A and Ap into the block-diagonal matrix A. We then have 
that the original RP-problem is equivalent to RS of the N A-structure which 
from Theorem 8.6 is equivalent to px(N) < 1. 


Algebraic proof of Theorem 8.8 
The definition of 4 gives at each frequency 

pz(N(jw)) < 1 det(I — N(jw)A(jw)) #0, VA, o(Ap(jw)) <1 
By Schur’s formula in (A.13) we have 


I-NyA —Ni2Ap 


det(J — NA) = det NGA. ones = 


det(I — Nii A) - det [I NozAp — Nu A(T NuA) | NizAp] = 
det(I — Ni1A) - det[I — (No2 + Na A(T — NiiA)7'Ni2)AP] = 
det(I = Ni A) < det (I = FAN, A)Ap) 
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RP 
STEP A t 
Il F llo< 4, V A llo< 1 
STEP B t 
V | Ap |lo<1 
is RS, 
VILA llo< 1 
V || Ap |loo< 1 
is RS, 
VIA loo< 1 


STEP D t 


isRS, V|| A lloo<1 


? (RS theorem) 


BR(N) <1, Vw 


Figure 8.13: RP as a special case of structured RS. F = F.,(N, A). 
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Since this expression should not be zero, both terms must be nonzero at each 
frequency, i.e., 


det(I— Ni1A)#0VA © pa(Ni1) <1, Ww (RS) 
and for all A 
det(I — FAp) #0 VWAp © pap(F) <1@4(F) <1, Ww (RP definition) 
Theorem 8.8 is proved by reading the above lines in the opposite direction. Note 


that it is not necessary to test for RS separately as it follows as a special case of the 
RP requirement. 


8.10.2 Summary of -conditions for NP, RS and RP 


Rearrange the uncertain system into the NA-structure of Figure 8.2, where 
the block-diagonal perturbations satisfy ||A||o. <1. Introduce 


F = F,(N,A) = Noo + Nor A(T — Ny A)" Mi. 


and let the performance requirement (RP) be ||F'||.. < 1 for all allowable 
perturbations. Then we have: 


NS © N (internally) stable (8.115) 

NP $ G(No2) = bap <1, Vw, and NS (8.116) 

RS © pwa(Nii) <1, Vw, and NS (8.117) 
~ A 0 

RP & px(N) <1, Ww, A= ie i , and NS (8.118) 


Here A is a block-diagonal matrix (its detailed structure depends on the 
uncertainty we are representing), whereas Ap always is a full complex matrix. 
Note that nominal stability (NS) must be tested separately in all cases. 

Although the structured singular value is not a norm, it is sometimes 
convenient to refer to the peak p-value as the “A-norm”. For a stable rational 
transfer matrix H(s), with an associated block structure A, we therefore define 


A ; 
| 1(s)|la = max wa (H (jw) 
For a nominally stable system we then have 


NP © ||Noalloo <1, RSS ||Nulla<1, RP ||NIlx <1 
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8.10.3. Worst-case performance and skewed-,i 


Assume we have a system for which the peak p-value for RP is 1.1. What 
does this mean? The definition of yz tells us that our RP-requirement would 
be satisfied exactly if we reduced both the performance requirement and the 
uncertainty by a factor of 1.1. So, 4s does not directly give us the worst-case 
performance, i.e. maxa G(F(A)), as one might have been expected. 

To find the worst-case weighted performance for a given uncertainty, one 
needs to keep the magnitude of the perturbations fixed (a(A) < 1), that is, 
we must compute skewed- of N as discussed in Section 8.9.2. We have, in 
this case, 


max, 9(FUWN, A)(jw)) = HN) (8.119) 
To find w*® numerically, we scale the performance part of N by a factor 
kim = 1/5 and iterate on k, until 4 = 1. That is, at each frequency skewed-ju 
is the value y*(N) which solves 


u(KmN)=1, Km= E pal (8.120) 


Note that js underestimates how bad or good the actual worst-case 
performance is. This follows because y*(N) is always further from 1 than 
BN). 

The corresponding worst-case perturbation may be obtained as follows: 
First compute the worst-case performance at each frequency using skewed-p. 
At the frequency where ju*(N) has its peak, we may extract the corresponding 
worst-case perturbation generated by the software, and then find a stable, 
all-pass transfer function that matches this. In the MATLAB p-toolbox, the 
single command weperf combines these three steps: [delwc,mulow,muup] = 
weperf (N,b1k,1);. 


8.11 Application: RP with input uncertainty 


We will now consider in some detail the case of multiplicative input 
uncertainty with performance defined in terms of weighted sensitivity, as 
illustrated in Figure 8.14. The performance requirement is then 


RP & |we(I+GpK) lle <1, VGp (8.121) 


where the set of plants is given by 


Gp=G(E+wrAp, |lArllo <1 (8.122) 
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Here wp(s) and wy(s) are scalar weights, so the performance objective is 
the same for all the outputs, and the uncertainty is the same for all inputs. 
We will mostly assume that A; is diagonal, but we will also consider the case 
when A; is a full matrix. 


Figure 8.14: Robust performance of system with input uncertainty. 


This problem is excellent for illustrating the robustness analysis of uncertain 
multivariable systems. It should be noted, however, that although the problem 
setup in (8.121) and (8.122) is fine for analyzing a given controller, it is less 
suitable for controller synthesis. For example, the problem formulation does 
not penalize directly the outputs from the controller. 

In this section, we will: 


1. Find the interconnection matrix N for this problem. 

2. Consider the SISO case, so that useful connections can be made with results 
from the previous chapter. 

3. Consider a multivariable distillation process for which we have already seen 
from simulations in Chapter 3 that a decoupling controller is sensitive to 
small errors in the input gains. We will find that 4 for RP is indeed much 
larger than 1 for this decoupling controller. 

4. Find some simple bounds on y for this problem and discuss the role of the 
condition number. 

5. Make comparisons with the case where the uncertainty is located at the 
output. 
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8.11.1 Interconnection matrix 


On rearranging the system into the NA-structure, as shown in Figure 8.14, 
we get, as in (8.32), 
= wyrTy | (8.123) 


weSG  wpS 
where T; = KG(I + KG)"1, S = (I+GK)~! and for simplicity we have 
omitted the negative signs in the 1,1 and 1,2 blocks of N, since u(N) = n(UN) 
with unitary U = i ra see (8.91). 
For a given controller K we can now test for NS, NP, RS and RP using 
(8.115)-(8.118). Here A = A; may be a full or diagonal matrix (depending on 
the physical situation). 


8.11.2 RP with input uncertainty for SISO system 
For a SISO system, conditions (8.115)-(8.118) with N as in (8.123) become 


NS << S,SG, KS and Ty are stable (8.124) 
NP © |wpS| <1, Ww (8.125) 
RS © |wrTr| <1, Vw (8.126) 
RP © |wpS|+|wrT7| <1, Vw (8.127) 


The RP-condition (8.127) follows using (8.110); that is 


= wt wrkKS)]— fwrTy wrTy 
p= Ea ae ola wpS 


= |wrTt| a |wpS| (8.128) 


where we have used T; = K.SG. For SISO systems T; = T, and we see that 
(8.127) is identical to (7.59), which was derived in Chapter 7 using a simple 
graphical argument based on the Nyquist plot of L=GK. 

Robust performance optimization, in terms of weighted sensitivity with 
multiplicative uncertainty for a SISO system, thus involves minimizing the 
peak value of ju(N) = |w;T|+|wpS|. This may be solved using DK-iteration 
as outlined later in Section 8.12. A closely related problem, which is easier 
to solve both mathematically and numerically, is to minimize the peak value 
(Hoo norm) of the mixed sensitivity matrix 


oo wes 
Nine Ba (8.129) 
From (A.93) we get that at each frequency p(N) = |wrT|+|wpS| differs from 
a(Nmnix) = VlwrTl? + lwpS/? by at most a factor /2; recall (7.63). Thus, 
minimizing ||Nmix||oo is close to optimizing robust performance in terms of 


BN). 
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8.11.3. Robust Performance for 2 x 2 distillation process 


Consider again the distillation process example from Chapter 3 (Motivating 
Example No. 2) and the corresponding inverse-based controller: 


G(s) 1 87.8 —86.4 0.7 


tact (1082. 1006 | SO a) 
The controller provides a nominally decoupled system with 

L=11, S=elandT=tl (8.131) 
Where 


= egg 1485 eA 


0.7 1 8 0.7 1 
Poe. goes : 1 
14+1 5s+0.7 


We have used ¢ for the nominal sensitivity in each loop to distinguish it from 
the Laplace variable s. Recall from Figure 3.12 that this controller gave an 
excellent nominal response, but that the response with 20% gain uncertainty 
in each input channel was extremely poor. We will now confirm these findings 
by a p-analysis. To this effect we use the following weights for uncertainty and 
performance: 
s+0.2 8/2+0.05 
WS ae URS 


With reference to (7.25) we see the weight w7(s) may approximately represent 
a 20% gain error and a neglected time delay of 0.9 min. |w;(jw)| levels off at 
2 (200% uncertainty) at high frequencies. With reference to (2.73) we see 
that the performance weight wp(s) specifies integral action, a closed-loop 
bandwidth of about 0.05 [rad/min] (which is relatively slow in the presence of 
an allowed time delay of 0.9 min) and a maximum peak for a(S) of M, = 2. 

We now test for NS, NP, NS and RP. Note that A; is a diagonal matrix in 
this example. 


(8.132) 


NS With G and K as given in (8.130) we find that S, SG, KS and Ty; are 
stable, so the system is nominally stable. 


NP With the decoupling controller we have 


G(No2) = G(wpS) = 3+0.7 


“12 +005) 


and we see from the dashed-dot line in Figure 8.15 that the NP-condition 
is easily satisfied: G(wpS) is small at low frequencies (0.05/0.7 = 0.07 
at w = 0) and approaches 1/2 = 0.5 at high frequencies. 
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Table 8.1: MATLAB program for Mu-analysis (generates Figure 8.15). 
%, Uses the Mu toolbox 

GO = [87.8 -86.4; 108.2 -109.6]; 

dyn = nd2sys(1,[75 1]); 

Dyn=daug(dyn,dyn); G=mmult(Dyn,G0); 


j, Inverse-based control. 
h 
dynk=nd2sys([75 1],[1 1.e-5],0.7); 
Dynk=daug(dynk,dynk); Kinv=mmult(Dynk,minv(GO)) ; 
h 
Frequency % Weights. 
h 
< eps Fi . , wp=nd2sys([10 1],[10 1.e-5],0.5); Wp=daug(wp,wp); 
Figure 8.15: y-plots for distillation process with decoupling controller wisnd2sys([1 0.2], [0.5 1]); Wisdaug(wi,wi) 
h 
% Generalized plant P. 
RS Since in this case w,T; = w7T is a scalar times the identity matrix, we h 


: systemnames = ’G Wp Wi’; 
have, independent of the structure of A;, that input varie. > [ydelCs aed mE 


outputvar = ’[Wi; Wp; -G-w]’; 
(w;T;) = |w7t| = |0.2 o8+1 input_to_G = ’[utydel]’; 
MA; WILT) = \Wrt] = WW. (0.55 + 1)(1.43s +1) oe = ae input_toWi = ’[u]’; 
sysoutname = ’P’; 
and we see from the dashed line in Figure 8.15 that RS is easily satisfied. uel Spy Sg ee 
The peak value of a ,(M) over frequency is || M ||,, = 0.53. This means N = starp(P,Kinv); omega = logspace(-3,3,61); 
MA; y, I 
that we may increase the uncertainty by a factor of 1/0.53 = 1.89 before = frep(N, omega) ; 
the worst-case uncertainty yields instability, that is, we can tolerate ma for RP. 
about 38% gain uncertainty and a time delay of about 1.7 min before % 
we get instability. bik = [1 1; 1 15 2 2]; 
[mubnds ,rowd,sens,rowp,rowg] = mu(Nf,blk,’c’); 
RP Although our system has good robustness margins (RS easily satisfied) ri anne aang Tas pevmoem (mane) Renee BG EZE 
and excellent nominal performance we know from the simulations in j, Worst-case weighted sensitivity 
Figure 3.12 that the robust performance is poor. This is confirmed by h : 
the p-curve for RP in Figure 8.15 which was computed numerically using poo cen rae a ; ae isn ee a 
Bx(N) with N as in (8.123), A = diag{Ar, Ap} and Ay = diag {d1, 62}. % mu for RS. 
The peak value is close to 6, meaning that even with 6 times less _ sa anita 
. A ioe a rs=se 51:2,1:2); 
uncertainty, the weighted sensitivity will be about 6 times larger than [mubnde: woud sens + ospcone leaulleoel ae a aes 
what we require. The peak of the actual worst-case weighted sensitivity muRS = sel(mubnds,’:’,1); pkvnorm(muRS) % (ans = 0.5242). 
with uncertainty blocks of magnitude 1, which may be computed using : ehctaed fancs ey oe 
skewed-p, is for comparison 44.93. a Beene See niet tye aR 
h 
Nnp=sel (Nf,3:4,3:4); 
The MATLAB p-toolbox commands to generate Figure 8.15 are given in [mubnds ,rowd, sens ,rowp, rowg]=mu(Nnp,[2 2],’c’); 
table 8.1. muNP = sel(mubnds,’:’,1); pkvnorm(muNP) ¥, (ans = 0.5000). 


vplot(’?liv,m’ ,muRP,muRS,muNP) ; 


In general, 4 with unstructured uncertainty (A; full) is larger than w with 
structured uncertainty (A; diagonal). However, for our particular plant and 
controller in (8.130) it appears from numerical calculations and by use of 
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(8.135) below, that they are the same. Of course, this is not generally true as 
is confirmed in the following exercise. 


Exercise 8.21 Consider the plant G(s) in (8.75) which is ill-conditioned with 
7(G) = 70.8 at all frequencies (but note that the RGA-elements of G are all about 
0.5). With an inverse-based controller K(s) = °*G(s)~', compute p for RP with 
both diagonal and full-block input uncertainty using the weights in (8.132). The value 
of u is much smaller in the former case. 


8.11.4 yz and the condition number 


In this subsection we consider the relationship between ys for RP and the 
condition number of the plant and controller. We consider unstructured 
multiplicative input uncertainty (i.e. A; is a full matrix) and performance 
measured in terms of weighted sensitivity. 

Any controller. Let N be given as in (8.123). Then 


RP RS NP 
—_—_—a ti ts 
ig (N) < (5(wrT) +5 (wpS)\(1 + VE) (8.133) 


where k is either the condition number of the plant or the controller (the 
smallest one should be used): 


k=7(G) or k= 7(K) (8.134) 
Proof of (8.133): Since Ay is a full matrix (8.95) yields 


ae Nu dNi2 
BON rs Ee N22 


where from (A.45) 


2, wrt dwiKS 


a |g ee | a(wrTr [I dGo'])+a(weS [dG ]) 


lA 


a(wiTr)a dGo')+e(weS)a(d'G ) 
——_—" —=———— 


_ 
<14|dla(G-1) <14|d—-1|2(G) 


and selecting d = ,/ een = /7(G) gives 


HN) < [@(wrTr) + F(wpS)] (1 + V¥(G) 


A similar derivation may be performed using SG = K~'Tr to derive the same 
expression but with y(K) instead of 7(G). 
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From (8.133) we see that with a “round” controller, i.e. one with y(A) = 1, 
there is less sensitivity to uncertainty (but it may be difficult to achieve NP 
in this case). On the other hand, we would expect ju for RP to be large if we 
used an inverse-based controller for a plant with a large condition number, 
since then y(.A’) = 7(G) is large. 


Example 8.12 For the distillation process studied above, we have y(G) = 7(K) = 
141.7 at all frequencies, and at frequency w = 1 rad/min the upper bound given by 
(8.133) becomes (0.52 + 0.41)(1 + V141.7) = 13.1. This is higher than the actual 
value of u(.N) which is 5.56, which illustrates that the bound in (8.183) is generally 
not tight. 


Inverse-based controller. With an inverse-based controller (resulting in 
the nominal decoupled system (8.131)) and unstructured input uncertainty, it 
is possible to derive an analytic expression for 4 for RP with N as in (8.123): 


1 
U(X) = mvc? + st? + hort burl (44@) + Ey) (8.135) 
where ¢ is the nominal sensitivity and y(G) is the condition number of the 
plant. We see that for plants with a large condition number, pu for RP increases 
approximately in proportion to \/7(G). 


Proof of (8.135): The proof originates from Stein and Doyle (1991). The upper 
p-bound in (8.95) with D = diag{dI, I} yields 


(iy =. dads wrtl wrt(dG)~! ne wrtl wrt(dd)—! 
B ~ a | wpe(dG) wpel ~T” | wee(dd) wpel 
: = wrt wrt(do;)~! 
= minmaxs 
di wpe(do;) W pe 


= min max J wre? + |wrt|? + |wpedoi|? + |wrt(doi)—1/? 
a 


We have here used the SVD of G = USV™ at each frequency, and have used the fact 
that ¢ is unitary invariant. a; denotes the 7’th singular value of G. The expression 
is minimized by selecting at each frequency d = |w7t|/(|wpe|(G)a(G)), see (8.108), 
and hence the desired result. 


Example 8.13 For the distillation column example studied above, we have at 
frequency w = 1 rad/min, |wpee| = 0.41 and |wrt| = 0.52, and since y(G) = 141.7 
at all frequencies, (8.135) yields u(N) = /0.17 + 0.27 + 30.51 = 5.56 which agrees 
with the plot in Figure 8.15. 


Worst-case performance (any controller) 


We next derive relationships between worst-case performance and the 
condition number. Suppose that at each frequency the worst-case sensitivity 
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is a(S’). We then have that the worst-case weighted sensitivity is equal to 
skewed-p: 

max a(wpS») = o(wpS’) = p*(N) 
Now, recall that in Section 6.10.4 we derived a number of upper bounds on 
a(S"), and referring back to (6.72) we find 


= (9 a(S) 
< ——_———_—_ : 
a(S") < 1G) (8.136) 
A similar bound involving y(K) applies. We then have 
Mh a(wpS) 
7 = 7 < . 
pe (N) = a(wpS") 1— swirl) (8.137) 


where k as before denotes either the condition number of the plant or the 
controller (preferably the smallest). Equation (8.137) holds for any controller 
and for any structure of the uncertainty (including A; unstructured). 


Remark 1 In Section 6.10.4 we derived tighter upper bounds for cases when A; 
is restricted to be diagonal and when we have a decoupling controller. In (6.78) we 
also derived a lower bound in terms of the RGA. 


Remark 2 Since u* = w when py = 1, we may, from (8.133), (8.137) and expressions 
similar to (6.76) and (6.77), derive the following sufficient (conservative) tests for 
RP (u(N) < 1) with unstructured input uncertainty (any controller): 


RP < [a(wpS)+4(wrTr)] + Vk) <1, Vw 


RP <= ka(wpS)+a(wirTr) <1, Vw 
RP < G(wpS)+ka(wrT) <1, Vw 


where k denotes the condition number of the plant or the controller (the smallest 
closely being the most useful). 


Example 8.14 For the distillation process the upper bound given by (8.137) at 
w = 1 rad/min is 141.7 - 0.41/(1 — 0.52) = 121. This is higher than the actual 
peak value of u* = maxs, G(wpSp) which as found earlier is 44.9 (at frequency 1.2 
rad/min), and demonstrates that these bounds are not generally tight. 


8.11.5 Comparison with output uncertainty 


Consider output multiplicative uncertainty of magnitude wo (jw). In this case 
we get the interconnection matrix 


(8.138) 


N- Re ner 


wpS wes 
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and for any structure of the uncertainty (NV) is bounded as follows: 


RP eet 
= wot f ~ = wol 
ee <j) < v3a [08] (8.139) 
—— 
NP 


This follows since the uncertainty and performance blocks both enter at the 
output (see Section 8.6.3) and from (A.44) the difference between bounding 
the combined perturbations, ¢[ Ao Ap] and individual perturbations, 
a(Ao) and a(Ap), is at most a factor of /2. Thus, in this case we 
“automatically” achieve RP (at least within V2) if we have satisfied separately 
the subobjectives of NP and RS. This confirms our findings from Section 
6.10.4 that multiplicative output uncertainty poses no particular problem 
for performance. It also implies that for practical purposes we may optimize 
robust performance with output uncertainty by minimizing the H,. norm of 


the stacked matrix [eee 
wps 


Exercise 8.22 Consider the RP-problem with weighted sensitivity and multiplica- 
tive output uncertainty. Derive the interconnection matrix N for, 1) the conventional 
case with A = diag{A, Ap}, and 2) the stacked case when A= [A Ap]. Use this 
to prove (8.139). 


8.12 y-synthesis and DK-iteration 


The structured singular value yz is a very powerful tool for the analysis of 
robust performance with a given controller. However, one may also seek p- 
synthesis to find the controller that minimizes a certain p-condition: 


8.12.1 DkK-iteration 


At present there is no direct method to synthesize a p-optimal controller. 
However, for complex perturbations a method known as DK-iteration is 
available. It combines H.-synthesis and p-analysis, and often yields good 
results. The starting point is the upper bound (8.95) on yw in terms of the 
scaled singular value 
p(N) < min o(DND“!) 
DED 


The idea is to find the controller that minimizes the peak value over frequency 
of this upper bound, namely 


, e AT, 
min(mnin || DN (K)D~ oo) (8.140) 
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by alternating between minimizing ||DN(K)D~+||.. with respect to either K 
or D (while holding the other fixed). To start the iterations, one selects an 
initial stable rational transfer matrix D(s) with appropriate structure. The 
identity matrix is often a good initial choice for D provided the system has 
been reasonably scaled for performance. The DK-iteration then proceeds as 
follows: 


1. K-step. Synthesize an H.. controller for the scaled problem, 
minx || DN(K)D—'||,. with fixed D(s). 

2. D-step. Find D(jw) to minimize at each frequency ¢(DND~1(jw)) with 
fixed NV. 

3. Fit the magnitude of each element of D(jw) to a stable and minimum phase 
transfer function D(s) and go to Step 1. 


The iteration may continue until satisfactory performance is achieved, 
||DND~" loo <1, or until the Ho. norm no longer decreases. One fundamental 
problem with this approach is that although each of the minimization steps 
(K-step and D-step) are convex, joint convexity is not guaranteed. Therefore, 
the iterations may converge to a local optimum. However, practical experience 
suggests that the method works well in most cases. 

The order of the controller resulting from each iteration is equal to the 
number of states in the plant G(s) plus the number of states in the weights 
plus twice the number of states in D(s). For most cases, the true p-optimal 
controller is not rational, and will thus be of infinite order, but because we 
use a finite-order D(s) to approximate the D-scales, we get a controller of 
finite (but often high) order. The true p-optimal controller would have a flat 
p-curve (as a function of frequency), except at infinite frequency where p 
generally has to approach a fixed value independent of the controller (because 
L(joo) = 0 for real systems). However, with a finite-order controller we will 
generally not be able (and it may not be desirable) to extend the flatness to 
infinite frequencies. 

The DkK-iteration depends heavily on optimal solutions for Steps 1 and 
2, and also on good fits in Step 3, preferably by a transfer function of low 
order. One reason for preferring a low-order fit is that this reduces the order 
of the H. problem, which usually improves the numerical properties of the 
Ho. optimization (Step 1) and also yields a controller of lower order. In some 
cases the iterations converge slowly, and it may be difficult to judge whether 
the iterations are converging or not. One may even experience the p-value 
increasing. This maybe caused by numerical problems or inaccuracy (e.g. the 
upper bound p-value in Step 2 being higher than the 7. norm obtained in 
Step 1), or by a poor fit of the D-scale. In any case, if the iterations converge 
slowly, then one may consider going back to the initial problem and rescaling 
the inputs and outputs. 
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In the K-step (Step 1) where the H. controller is synthesized, it is often 
desirable to use a slightly sub-optimal controller (e.g. y is 5% higher than 
Ymin). This yields a blend of H and Hy optimality with a controller which 
usually has a steeper high-frequency roll-off than the H.. optimal controller. 


8.12.2 Adjusting the performance weight 


Recall that if js at a given frequency is different from 1, then the interpretation 
is that at this frequency we can tolerate 1/y-times more uncertainty and still 
satisfy our performance objective with a margin of 1/. In p-synthesis, the 
designer will adjust some parameter in the performance or uncertainty weights 
until the peak p-value is close to 1. Sometimes the uncertainty is fixed, and 
we effectively optimize worst-case performance by adjusting a parameter in 
the performance weight. For example, consider the performance weight 


8/M +; 
20 rman (8.141) 
B 


where we want to keep M constant and find the highest achievable bandwidth 
frequency w%. The optimization problem becomes 


max|w}| such that p(N) < 1,Vw (8.142) 


where N, the interconnection matrix for the RP-problem, depends on wy. 
This may be implemented as an outer loop around the DK-iteration. 


8.12.3. Fixed structure controller 


Sometimes it is desirable to find a low-order controller with a given structure. 
This may be achieved by numerical optimization where pz is minimized with 
respect to the controller parameters (sometimes denoted “jK-iteration” ). 
The problem here is that the optimization is not generally convex in the 
parameters. Sometimes it helps to switch the optimization between minimizing 
the peak of pu (ie., |||.) and minimizing the integral square deviation of yu 
away from k (ie., ||u(jw) — &ll2) where & usually is close to 1. The latter is 
an attempt to “flatten out” wp. 


8.12.4 Example: p-synthesis with DK-iteration 


We will consider again the case of multiplicative input uncertainty and 
performance defined in terms of weighted sensitivity, as discussed in detail 
in Section 8.11. We noted there that this set-up is fine for analysis, but less 
suitable for controller synthesis, as it does not explicitly penalize the outputs 
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from the controller. Nevertheless we will use it here as an example of p- 
synthesis because of its simplicity. The resulting controller will have very large 
gains at high frequencies and should not be used directly for implementation. 
In practice, one can add extra roll-off to the controller (which should work 
well because the system should be robust with respect to uncertain high- 
frequency dynamics), or one may consider a more complicated problem set-up 
(see Section 12.4). 

With this caution in mind, we proceed with the problem description. Again, 
we use the model of the simplified distillation process 


1 87.8 —86.4 
hea aT) Ee anal eae) 
The uncertainty weight w7I and performance weight wp/ are given in (8.132), 
and are shown graphically in Figure 8.16. The objective_is to minimize the 
peak value of ux(N), where N is given in (8.123) and A = diag{A;, Ap}. 
We will consider diagonal input uncertainty (which is always present in 
any real problem), so A; is a 2 x 2 diagonal matrix. Ap is a full 2 x 2 
matrix representing the performance specification. Note that we have only 
three complex uncertainty blocks, so u(N) is equal to the upper bound 
minp ¢(DND~*) in this case. 

We will now use DK-iteration in attempt to obtain the u-optimal controller 
for this example. The appropriate commands for the MATLAB p-toolbox are 


listed in table 8.2. 


10° 


e 
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Figure 8.16: Uncertainty and performance weights. Notice that there is a frequency 
range where both weights are less than one in magnitude. 


integrator 

First the generalized plant P as given in (8.29) is constructed. It includes 
the plant model, the uncertainty weight and the performance weight, but not 
the controller which is to be designed (note that N = Fi(P,K)). Then the 
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Table 8.2: MATLAB program to perform Dk-iteration. 
% Uses the Mu toolbox 


GO = [87.8 -86.4; 108.2 -109.6]; y%, Distillation 
dyn = nd2sys(1,[75 1]); Dyn = daug(dyn,dyn) ; % process. 

G = mmult (Dyn,GO); 

h 

% Weights. 

h 

wp = nd2sys([10 1],[10 1.e-5],0.5); % Approximated 
wi = nd2sys([1 0.2],[0.5 1]); % integrator. 
Wp = daug(wp,wp); Wi = daug(wi,wi); 

h 

% Generalized plant P. % 

systemnames = ’G Wp Wi’; 


inputvar = ’[ydel(2); w(2) ; u(2)]’; 


outputvar = ’[Wi; Wp; -G-w]’; 

input_to_G = ’[utydel]’; 

input_toWp = ’[G+w]’; input_toWi = ’[u]’; 
sysoutname = ’P’; cleanupsysic = ’yes’; 
sysic; 

h 

¥, Initialize. 

h 


omega = logspace(-3,3,61); 

blk = [1 1; 1 1; 2 2]; 

nmeas=2; nu=2; gmin=0.9; gamma=2; tol=0.01; dO = 1; 
dsysl = daug(d0,d0,eye(2),eye(2)); dsysr=dsys1; 


% START ITERATION. 


% STEP 1: Find H-infinity optimal controller 
% with given scalings: 


DPD = mmult(dsys1,P,minv(dsysr)); gmax=1.05*gamma; 
[K,Nsc,gamma] = hinfsyn(DPD,nmeas,nu,gmin, gmax,tol); 


Nf=frsp(Nsc, omega) ; y, (Remark: 

vA % Without scaling: 
4, % Nestarp(P,Kk);). 

% STEP 2: Compute mu using upper bound: 

h 

[mubnds ,rowd,sens,rowp,rowg] = mu(Nf,blk,’c’); 

vplot(’?liv,m’,mubnds); murp=pkvnorm(mubnds, inf) 

h 

¥, STEP 3: Fit resulting D-scales: 

h 

[dsysl,dsysr]=musynflp(dsysl,rowd,sens,blk,nmeas,nu) ; %, choose 4th order. 
h 

i, New Version: 

% [dsysL,dsysR]=msf (Nf ,mubnds, rowd,sens, blk) ; % order: 4, 4, 0. 

% dsysl=daug(dsysL,eye(2)); dsysr=daug(dsysR, eye(2)); 

h 

% GOTO STEP 1 (unless satisfied with murp). 

h 
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block-structure is defined; it consists of two 1 x 1 blocks to represent A; and 
a 2x 2 block to represent Ap. The scaling matrix D for DN D7 then has 
the structure D = diag{d1, d2,d3I2} where Iz is a 2 x 2 identity matrix, and 
we may set dj = 1. As initial scalings we select d? = d§ = 1. P is then 
scaled with the matrix diag{D, Iz} where In is associated with the inputs and 
outputs from the controller (we do not want to scale the controller). 

Iteration No. 1. Step 1: With the initial scalings, D° = I, the Ho software 
produced a 6 state controller (2 states from the plant model and 2 from each 
of the weights) with an Ho norm of y = 1.1823. Step 2: The upper u-bound 
gave the pu-curve shown as curve “Iter. 1” in Figure 8.17, corresponding to a 
peak value of =1.1818. Step 3: The frequency-dependent d,(w) and dy(w) 
from Step 2 were each fitted using a 4th order transfer function. d,(w) and 
the fitted 4th-order transfer function (dotted line) are shown in Figure 8.18 
and labelled “Iter. 1”. The fit is very good so it is hard to distinguish the two 
curves. dy is not shown because it was found that d, ~ dy (indicating that 
the worst-case full-block A; is in fact diagonal). 

Iteration No. 2. Step 1: With the 8 state scaling D1(s) the H.. software gave 
a 22 state controller and ||D1N(D1)~"||.o = 1.0238. Step 2: This controller 
gave a peak value of yu of 1.0238. Step 3: The resulting scalings D? were only 
slightly changed from the previous iteration as can be seen from d?(w) labelled 
“Tter. 2” in Figure 8.18. 

Iteration No. 3. Step 1: With the scalings D?(s) the H.. norm was only 
slightly reduced from 1.024 to 1.019. Since the improvement was very small 
and since the value was very close to the desired value of 1, it was decided to 
stop the iterations. The resulting controller was reduced from 22 states to 21 
states using balanced realization (sysbal1), and this controller (denoted K3 in 
the following) gave a peak p-value of 1.019. 
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Figure 8.17: Change in » during DK-iteration 
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Figure 8.18: Change in D-scale d; during D K-iteration 
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Figure 8.19: y-plots with p-optimal controller Ks 


Analysis of p-optimal controller K3 


The final p-curves for NP, RS and RP with controller K3 are shown in 
Figure 8.19. The objectives of RS and NP are easily satisfied. Furthermore, 
the peak p-value of 1.019 with controller K3 is only slightly above 1, so the 
performance specification ¢(wpS,) < 1 is almost satisfied for all possible 
plants. To confirm this we considered the nominal plant and six perturbed 
plants 


Gi(s) = G(s)En(s) 


where E;; = I + wyAy, is a diagonal transfer function matrix representing 
input uncertainty (with nominal F7) = J). Recall that the uncertainty weight 
is 
(s) s+0.2 
wyr(s) = —>—— 
‘ 0.55+1 
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Figure 8.20: Perturbed sensitivity functions a(S") using y-optimal controller K3. 
Dotted lines: Plants G',,i = 1,6. Lower solid line: Nominal plant G. Upper solid line: 
Worst-case plant G't,,. 
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Figure 8.21: Setpoint response for p-optimal controller K3. Solid line: nominal 
plant. Dashed line: uncertain plant G5 


which is 0.2 in magnitude at low frequencies. Thus, the following input gain 
perturbations are allowable 


12 0 0.8 0 250 0.8 0 
as. | 0 ty) Be = 0 19): B= 0 og|* Bx 0 na 


These perturbations do not make use of the fact that w7(s) increases with 
frequency. Two allowed dynamic perturbations for the diagonal elements in 
wyAy are 


(s) —s+0.2 (s) s+0.2 
€ = —_, « = —-——_ 
‘ 05s+1’ ” 0.5541 
corresponding to elements in F;, of 
—0.4175+1 —0.633s + 1 
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so let us also consider 


men [8 hese [8 


The maximum singular value of the sensitivity, ¢(S/), is shown in Figure 8.20 
for the nominal and six perturbed plants, and is seen to be almost below 
the bound 1/|w;(jw) for all seven cases (i = 0,6) illustrating that RP is 
almost satisfied. The sensitivity for the nominal plant is shown by the lower 
solid line, and the others with dotted lines. At low frequencies the worst-case 
corresponds closely to a plant with gains 1.2 and 0.8, such as G4, G4 or G%. 
Overall, the worst-case of these six plants seems to be G4 = GEj6, which has 
a(S’) close to the bound at low frequencies, and has a peak of about 2.02 
(above the allowed bound of 2) at 1.6 rad/min. 

To find the “true” worst-case performance and plant we used the MATLAB 
p-tools command weperf as explained in Section 8.10.3 on page 347. 
This gives a worst-case performance of maxsg, ||wpSp|loo = 1.037, and the 
sensitivity function for the corresponding worst-case plant G',,.(s) = G(s)(I+ 
wr(s)Awe(s)) found with the software is shown by the upper solid line in 
Figure 8.20. It has a peak value of a(S") of about 2.05 at 0.79 rad/min. 


Remark. The “worst-case” plant is not unique, and there are many plants which 
yield a worst-case performance of maxs,, ||wpSp||o0 = 1.037. In particular, we may 
most likely find plants which are more consistently “worse” at all frequencies, that 
the one shown by the upper solid line in Figure 8.20. 


The time response of y, and yo to a filtered setpoint change in y,, 
r, = 1/(5s+ 1), is shown in Figure 8.21 both for the nominal case (solid line) 
and for 20% input gain uncertainty (dashed line) using the plant G = GF3 
(which we know is one of the worst plants). The response is interactive, but 
shows no strong sensitivity to the uncertainty. The response with uncertainty 
is seen to be much better than that with the inverse-based controller studied 
earlier and shown in Figure 3.12. 


Remarks on the pi-synthesis example. 


1. By trial and error and many long nights Petter Lundstrom was able to reduce the 
peak p-value for robust performance for this problem down to about flopt = 0.974 
(Lundstrém, 1994). The resulting design produced the curves labelled “optimal” 
in Figures 8.17 and 8.18. The corresponding controller, Kop:, may be synthesized 
using 7.0-synthesis with the following third-order D-scales 


(0.001s + 1)(s + 0.25)(s + 0.054) 

((s + 0.67)? + 0.567) (s + 0.013) 

2. Note that the optimal controller Kop: for this problem has an SVD-form. That 
is, let G = UNV", then Kopp = V" KaV where Kg is a diagonal matrix. This 
arises because in this example case U and V are constant matrices. For more 
details see Hovd (1992) and Hovd et al. (1994). 


di(s) = do(s) =2 da =1 (8.144) 


ROBUST STABILITY AND PERFORMANCE 365 


3. For this particular plant it appears that the worst-case full-block input 
uncertainty is a diagonal perturbation, so we might as well have used a full 
matrix for A;. But this does not hold in general. 

4. The Hoo software may encounter numerical problems if P(s) has poles on the jw- 
axis. This is the reason why in the MATLAB code we have moved the integrators 
(in the performance weights) slightly into the left-half plane. 

5. The initial choice of scaling D = I gave a good design for this plant with an Hoo 
norm of about 1.18. This scaling worked well because the inputs and outputs 
had been scaled to be of unit magnitude. For a comparison, consider the original 
model in Skogestad et al. (1988) which was in terms of unscaled physical variables. 


Gunscalea($) (8.145) 


eee | 0.878 —0.864 
~ 75s+1 11.082 —1.096 


(8.145) has all its elements 100 times smaller than in the scaled model (8.143). 
Therefore, using this model should give the same optimal p-value but with 
controller gains 100 times larger. However, starting the DK-iteration with D = I 
works very poorly in this case. The first iteration yields an H.. norm of 14.9 (Step 
1) resulting in a peak p-value of 5.2 (Step 2). Subsequent iterations yield with 
3rd and 4th order fits of the D-scales the following peak p-values: 2.92, 2.22, 
1.87, 1.67, 1.58, 1.53, 1.49, 1.46, 1.44, 1.42. At this point (after 11 iterations) 
the y-plot is fairly flat up to 10 [rad/min] and one may be tempted to stop the 
iterations. However, we are still far away from the optimal value which we know 
is less than 1. This demonstrates the importance of good initial D-scales, which 
is related to scaling the plant model properly. 


Exercise 8.23 Explain why the optimal p-value would be the same if we in the 
model (8.143) changed the time constant of 75 [min] to another value. (But the 
p-iteration itself would be affected.) 


8.13 Further remarks on 


8.13.1 Further justification for the upper bound on 1 


For complex perturbations, the scaled singular value ¢(DND~") is a tight 
upper bound on p(N) in most cases, and minimizing the upper bound 
||DND-*||,.. forms the basis for the DK-iteration. However, ||DND-"||.. 
is also of interest in its own right. The reason for this, is that when all 
uncertainty blocks are full and complex the upper bound provides a necessary 
and sufficient condition for robustness to arbitrary-slow time-varying linear 
uncertainty (Poolla and Tikku, 1995). On the other hand, the use of j assumes 
the uncertain perturbations to be time-invariant. In some cases, it can be 
argued that slowly time-varying uncertainty is more useful than constant 
perturbations, and therefore that it is better to minimize ||DND~1||.. instead 
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of (NV). In addition, by considering how D(w) varies with frequency, one can 
find bounds on the allowed time variations in the perturbations. 

Another interesting fact is that the use of constant D-scales (D is not 
allowed to vary with frequency), provides a necessary and sufficient condition 
for robustness to arbitrary-fast time-varying linear uncertainty (Shamma, 
1994). It may be argued that such perturbations are unlikely in a practical 
situation. Nevertheless, we see that if we can get an acceptable controller 
design using constant D-scales, then we know that this controller will work 
very well even for rapid changes in the plant model. Another advantage of 
constant D-scales is that the computation of yz is then straightforward and 
may be solved using LMIs, see (8.151) below. 


8.13.2 Real perturbations and the mixed problem 


We have not discussed in any detail the analysis and design problems which 
arise with real or, more importantly, mixed real and complex perturbations. 

The current algorithms, implemented in the MATLAB p-toolbox, employ 
a generalization of the upper bound a(DMD-~—"), where in addition to D 
matrices, which exploit the block-diagonal structure of the perturbations, 
there are G matrices, which exploit the structure of the real perturbations. The 
G-matrices have real diagonal elements where A is real and zeros elsewhere. 
The algorithm in the p-toolbox makes use of the following result from Young, 
Newlin and Doyle (1992): If there exists a G > 0, a D and aG with the allowed 
block-diagonal structure such that 


; (« +6)-1(GDMD~ ~ jG) +@y4) <1 (8.146) 


then p(M/) < (. For more details, the reader is referred to Young (1993). 

There is also a corresponding DGK-iteration procedure for synthesis 
(Young, 1994). The practical implementation of this algorithm is however 
difficult, and a very high order fit may be required for the G-scales. An 
alternative approach which involves solving a series of scaled D K-iterations 
is given by (Toffner-Clausen, Andersen, Stoustrup and Niemann, 1995). 


8.13.3. Computational complexity 


Tt has been established that the computational complexity of computing pu 
has a combinatoric (non-polynomial, “NP-hard”) growth with the number 
of parameters involved even for purely complex perturbations (Toker and 
Ozbay, 1995). 

This does not mean, however, that practical algorithms are not possible, 
and we have described practical algorithms for computing upper bounds of 
for cases with complex, real or mixed real/complex perturbations. 
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As mentioned, the upper bound ¢(DMD~*) for complex perturbations is 
generally tight, whereas the present upper bounds for mixed perturbations 
(see (8.146)) may be arbitrarily conservative. 

There also exists a number of lower bounds for computing pu. Most of these 
involve generating a perturbation which makes J — M A singular. 


8.13.4 Discrete case 


It is also possible to use ys for analyzing robust performance of discrete-time 
systems (Packard and Doyle, 1993). Consider a discrete time system 


Troi = AT, + Buz, YR= Carp + Duy 


The corresponding discrete transfer function matrix from u to y is N(z) = 
C(zI—A)~!B+D. First, note that the H norm of a discrete transfer function 


is 
IN loo = max o(C(zI — A)~'B + D) 
|2|21 
This follows since evaluation on the jw-axis in the continuous case is equivalent 
to the unit circle (|z| = 1) in the discrete case. Second, note that N(z) may 
be written as an LFT in terms of 1/z, 


(8.147) 


N(z) = C(I ~ A)" B+ D = F,(H, =D; #=|¢ | 


C D 


Thus, by introducing 6, = 1/z and A, = 6,I we have from the main loop 
theorem (which generalizes Theorem 8.8) that ||N ||. < 1 (NP) if and only if 


uz(H) <1, A=diag{A.,Ap} (8.148) 


where A, is a matrix of repeated complex scalars, representing the discrete 
“frequencies”, and Ap is a full complex matrix, representing the singular 
value performance specification. Thus, we see that the search over frequencies 
in the frequency domain is avoided, but at the expense of a complicated p- 
calculation. The condition in (8.148) is also referred to as the state-space 
test. 

Condition (8.148) only considers nominal performance (NP). However, note 
that in this case nominal stability (NS) follows as a special case (and thus does 
not need to be tested separately), since when wz(H) <1 (NP) we have from 
(8.85) that wa.(A) = p(A) < 1, which is the well-known stability condition 
for discrete systems. 

We can also generalize the treatment to consider RS and RP. In particular, 
since the state-space matrices are contained explicitly in H in (8.147), it 
follows that the discrete time formulation is convenient if we want to consider 
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parametric uncertainty in the state-space matrices. This is discussed by 
Packard and Doyle (1993). However, this results in real perturbations, and 
the resulting u-problem which involves repeated complex perturbations (from 
the evaluation of z on the unit circle), a full-block complex perturbation (from 
the performance specification), and real perturbations (from the uncertainty), 
is difficult to solve numerically both for analysis and in particular for synthesis. 
For this reason the discrete-time formulation is little used in practical 
applications. 


8.13.5 Relationship to linear matrix inequalities (LMIs) 


An example of an LMI problem is the following. Does there exist an X = X¥ 
(where X may have additional block-diagonal structure) such that 


PEXP—-QUXQ+XR+R™7X4+5<0 (8.149) 


where P,Q, Rand S are matrices. Depending on the particular problem, the < 

may be replaced by <. These inequality conditions produce a set of solutions 

which are convex, which make LMIs attractive computationally. Sometimes, 

the matrices P,Q, R and S are functions of a real parameter 3, and we want 

to know, for example, what is the largest 3 for which there is no solution. 
The upper bound for py can be rewritten as an LMI: 


o(DMD-") <B&p(D"'M™DDMD"!) <#? (8.150) 
© DME DDMD"~— 81 <086 M"D?M—~D? <0 (8.151) 


which is an LMI in X = D? > 0. To compute the upper bound for yp based on 
this LMI we need to iterate on 3. It has been shown that a number of other 
problems, including 72 and H. optimal control problems, can be reduced to 
solving LMIs. The reader is referred to Boyd, Ghaoui, Feron and Balakrishnan 
(1994) for more details. 


8.14 Conclusion 


In the two preceding chapters we have discussed how to represent uncertainty 
and how to analyze its effect on stability (RS) and performance (RP) using 
the structured singular value as our main tool. 

To analyze robust stability (RS) of the uncertain system we made use of 
the MA-structure (Figure 8.3) where M represents the transfer function for 
the “new” feedback part generated by the uncertainty. From the small gain 
theorem 

RS <— «a(M)<1 Ww (8.152) 
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This condition is tight (necessary and sufficient) for the special case where at 
each frequency any complex A satisfying (A) < 1 is allowed. More generally, 
the tight condition is 

RS © w(M)<1 Ww (8.153) 


where j1(M/) is the structured singular value (M7). The calculation of js makes 
use of the fact that A has a given block-diagonal structure, where also certain 
blocks may be real (e.g. to handle parametric uncertainty). 

We defined robust performance (RP) as ||Fi(N,A)|loo < 1 for all allowed 
A’s. Since we used the #.. norm in both the representation of uncertainty 
and definition of performance, we found that RP could be viewed as a special 
case of RS, and we derived 


RP © p(N)<1 Ww (8.154) 


where yp is computed with respect to the block-diagonal structure 
diag{A, Ap}. Here A represent the uncertainty and Ap is a ficticious full 
uncertainty block representing the H.. performance bound. 

It should be noted that there are two main approaches to get a robust 
design: 


1. Make the system robust to some “general” class of uncertainty which we do 
not explicitly model. For SISO systems the classical gain and phase margins 
and the peaks of S and T provide useful general robustness measures. 
For MIMO systems the normalized coprime factor uncertainty provides a 
good general class of uncertainty, and the associated Glover-McFarlane Ho 
loop shaping design procedure, see Chapter 10, has proved itself useful in 
applications. 

2. The second approach has been the focus of the preceding two chapters. It 
involves explicitly modelling and quantifying the uncertainty for a given 
plant. Potentially, it yields better designs, but it may require a much 
larger effort in terms of uncertainty modelling, especially if parametric 
uncertainty is considered. Analysis and, in particular, synthesis using p is 
also more involved. 


In applications, it is therefore recommended to start with the first approach, 
and switch to the second approach if the performance is not satisfactory, and 
the uncertainty has structure such that the use of yz is less conservative. 


Practical p-analysis 


We end the chapter by providing a few recommendations on how to use the 
structured singular value ps in practice. 


1. Because of the effort involved in deriving detailed uncertainty descriptions, 
and the subsequent complexity in synthesizing controllers, the rule is 
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to “start simple” with a crude uncertainty description, and then to 
see whether the performance specifications can be met. Only if they 
don’t, should one consider more detailed uncertainty descriptions such as 
parametric uncertainty (with real perturbations). 

2. The use of y implies a worst-case analysis, so one should be careful 
about including too many sources of uncertainty, noise and disturbances 
— otherwise it becomes very unlikely for the worst case to occur, and the 
resulting analysis and design may be unnecessary conservative. 

3. There is always uncertainty with respect to the inputs and outputs, so it 
is generally “safe” to include diagonal input and output uncertainty. The 
relative (multiplicative) form is usually very convenient in this case, where 
the weight starts from say 0.1 and crosses 1 at a frequency where you have 
neglected dynamics in the input or output in question. 

4. is most commonly used for analysis. If ys is used for synthesis, then 
it is usually recommended to keep the uncertainty fixed and adjust the 
parameters in the performance weight until py is close to 1. 


9 


CONTROL SYSTEM DESIGN 


In this chapter, we present practical procedures for multivariable controller design 
which are relatively straightforward to apply and which, in our opinion, have an 
important role to play in industrial control. 

For industrial systems which are either SISO or loosely coupled, the classical 
loop-shaping approach to control system design as described in Section 2.6 has been 
successfully applied. But for truly multivariable systems it has only been in the last 
decade, or so, that reliable generalizations of this classical approach have emerged. 


9.1 Trade-offs in MIMO feedback design 


The shaping of multivariable transfer functions is based on the idea that a 
satisfactory definition of gain (range of gain) for a matrix transfer function 
is given by the singular values of the transfer function. By multivariable 
transfer function shaping, therefore, we mean the shaping of singular values 
of appropriately specified transfer functions such as the loop transfer function 
or possibly one or more closed-loop transfer functions. This methodology 
for controller design is central to the practical procedures described in this 
chapter. 

In February 1981, the IEEE Transactions on Automatic Control published 
a Special Issue on Linear Multivariable Control Systems, the first six papers 
of which were on the use of singular values in the analysis and design of 
multivariable feedback systems. The paper by Doyle and Stein (1981) was 
particularly influential: it was primarily concerned with the fundamental 
question of how to achieve the benefits of feedback in the presence of 
unstructured uncertainty, and through the use of singular values it showed 
how the classical loop-shaping ideas of feedback design could be generalized 
to multivariable systems. To see how this was done, consider the one degree- 
of-freedom configuration shown in Figure 9.1. The plant G and controller K 
interconnection is driven by reference commands r, output disturbances d, 
and measurement noise n. y are the outputs to be controlled, and u are the 


372 MULTIVARIABLE FEEDBACK CONTROL 


control signals. In terms of the sensitivity function S$ = (I + GK)~! and 
the closed-loop transfer function T = GK (I +GK)~! = I — S, we have the 
following important relationships: 


Figure 9.1: One degree-of-freedom feedback configuration 


y(s) = T(s)r(s) + S(s)d(s) — T(s)n(s) (9-1) 
u(s) = K(s)S(s) [r(s) — n(s) — d(s)] 


These relationships determine several closed-loop objectives, in addition to 
the requirement that K stabilizes G; namely: 


1. For disturbance rejection make a(S) small. 

2. For noise attenuation make a(T) small. 

3. For reference tracking make a(T) = o(T) = 1. 
4. For control energy reduction make o(.K.S) small. 


If the unstructured uncertainty in the plant model G is represented by an 
additive perturbation, i.e. G, = G+ A, then a further closed-loop objective 
is 


5. For robust stability in the presence of an additive perturbation make (KS) 
small. 


Alternatively, if the uncertainty is modelled by a multiplicative output 
perturbation such that G, = (I + A)G, then we have: 


6. For robust stability in the presence of a multiplicative output perturbation 
make o(T) small. 


The closed-loop requirements 1 to 6 cannot all be satisfied simultaneously. 
Feedback design is therefore a trade-off over frequency of conflicting 
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objectives. This is not always as difficult as it sounds because the frequency 
ranges over which the objectives are important can be quite different. For 
example, disturbance rejection is typically a low frequency requirement, while 
noise mitigation is often only relevant at higher frequencies. 

In classical loop shaping, it is the magnitude of the open-loop transfer 
function EL = GK which is shaped, whereas the above design requirements 
are all in terms of closed-loop transfer functions. However, recall from (3.48) 
that 

ipso Soa (9.3) 

a(S) 

from which we see that a(S) = 1/a(L) at frequencies where o(ZL) is much 
larger than 1. It also follows that at the bandwidth frequency (where 
1/a(S(jwe)) = V2 = 1.41) we have o(L(jwp)) between 0.41 and 2.41. 
Furthermore, from T = L(I + L)~! it follows that a(T) = @(L) at frequencies 
where a(L) is small. Thus, over specified frequency ranges, it is relatively 
easy to approximate the closed-loop requirements by the following open-loop 
objectives: 


1. For disturbance rejection make ¢(GK) large; valid for frequencies at which 
a(GK) > 1. 

2. For noise attenuation make (GK) small; valid for frequencies at which 
a(Gk) «1. 

3. For reference tracking make o(GK) large; valid for frequencies at which 
a(GK) > 1. 

4. For control energy reduction make o(K) small; valid for frequencies at 
which a(GK) <1. 

5. For robust stability to an additive perturbation make a(K’) small; valid for 
frequencies at. which (GK) <1. 

6. For robust stability to a multiplicative output perturbation make a(GK) 
small; valid for frequencies at which (GK) < 1. 


Typically, the open-loop requirements 1 and 3 are valid and important at low 
frequencies, 0 < w < uw, < wp, while 2, 4, 5 and 6 are conditions which are 
valid and important at high frequencies, wR < wp, < w < , as illustrated 
in Figure 9.2. From this we see that at frequencies where we want high gains 
(at low frequencies) the “worst-case” direction is related to o(GK), whereas 
as frequencies where we want low gains (at high frequencies) the “worst-case” 
direction is related to (GK). 


Exercise 9.1 Show that the closed-loop objectives 1 to 6 can be approximated by 
the open-loop objectives 1 to 6 at the specified frequency ranges. 


From Figure 9.2, it follows that the control engineer must design K so that 
(GK) and o(GK) avoid the shaded regions. That is, for good performance, 
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Figure 9.2: Design trade-offs for the multivariable loop transfer function GK 


a(GK) must be made to lie above a performance boundary for all w up to uy, 
and for robust stability ¢(GK) must be forced below a robustness boundary 
for all w above w;,. To shape the singular values of GK by selecting K is a 
relatively easy task, but to do this in a way which also guarantees closed-loop 
stability is in general difficult. Closed-loop stability cannot be determined 
from open-loop singular values. 

For SISO systems, it is clear from Bode’s work (1945) that closed-loop 
stability is closely related to open-loop gain and phase near the crossover 
frequency w,., where |G.K(jw,)| = 1. In particular, the roll-off rate from high 
to low gain at crossover is limited by phase requirements for stability, and in 
practice this corresponds to a roll-off rate less than 40dB/decade; see section 
2.6.2. An immediate consequence of this is that there is a lower limit to the 
difference between wp, and uy in Figure 9.2. 

For MIMO systems a similar gain-phase relationship holds in the crossover 
frequency region, but this is in terms of the eigenvalues of GK and results 
in a limit on the roll-off rate of the magnitude of the eigenvalues of GK, 
not the singular values (Doyle and Stein, 1981). The stability constraint is 
therefore even more difficult to handle in multivariable loop-shaping than 
it is in classical loop-shaping. To overcome this difficulty Doyle and Stein 
(1981)proposed that the loop-shaping should be done with a controller that 
was already known to guarantee stability. They suggested that an LQG 
controller could be used in which the regulator part is designed using a 
“sensitivity recovery” procedure of Kwakernaak (1969) to give desirable 
properties (gain and phase margins) in G'K. They also gave a dual “robustness 
recovery” procedure for designing the filter in an LQG controller to give 
desirable properties in KG. Recall that KG is not in general equal to GK, 
which implies that stability margins vary from one break point to another ina 
multivariable system. Both of these loop transfer recovery (LTR) procedures 
are discussed below after first describing traditional LQG control. 
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9.2 LQG control 


Optimal control, building on the optimal filtering work of Wiener in the 1940’s, 
reached maturity in the 1960’s with what we now call Linear Quadratic 
Gaussian or LQG Control. Its development coincided with large research 
programs and considerable funding in the United States and the former 
Soviet Union on space related problems. These were problems, such as rocket 
manoeuvering with minimum fuel consumption, which could be well defined 
and easily formulated as optimizations. Aerospace engineers were particularly 
successful at applying LQG, but when other control engineers attempted 
to use LQG on everyday industrial problems a different story emerged. 
Accurate plant models were frequently not available and the assumption of 
white noise disturbances was not always relevant or meaningful to practising 
control engineers. As a result LQG designs were sometimes not robust 
enough to be used in practice. In this section, we will describe the LQG 
problem and its solution, we will discuss its robustness properties, and we will 
describe procedures for improving robustness. Many text books consider this 
topic in far greater detail; we recommend Anderson and Moore (1989) and 
Kwakernaak and Sivan (1972). 


9.2.1 Traditional LQG and LQR problems 


In traditional LQG Control, it is assumed that the plant dynamics are linear 
and known, and that the measurement noise and disturbance signals (process 
noise) are stochastic with known statistical properties. That is, we have a 
plant model 


t = Ar+But+wg 
= Cr+wn (9.5) 


where wg and wy, are the disturbance (process noise) and measurement noise 
inputs respectively, which are usually assumed to be uncorrelated zero-mean 
Gaussian stochastic processes with constant power spectral density matrices 
W and V respectively. That is, wa and wy, are white noise processes with 
covariances 


E {wa(t)wa(t)’} = W6(t—7) (9.6) 
E {wn (t)Wn (r)7} = Vd(t—T) 
and 
E {wa(t)wn(7)"} =0, E {wn (t)wa(r)* } =0 (9.8) 


where FE is the expectation operator and 6(t — 7) is a delta function. 
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The LQG control problem is to find the optimal control u(t) which 
minimizes 


T 
J= | lim zi [2’ Qz +u" Rul al (9.9) 
T30 T 0 


where Q and R are appropriately chosen constant weighting matrices (design 
parameters) such that Q = QT > 0 and R = R™ > 0. The name LQG 
arises from the use of a Linear model, an integral Quadratic cost function, 
and Gaussian white noise processes to model disturbance signals and noise. 

The solution to the LQG problem, known as the Separation Theorem or 
Certainty Equivalence Principle, is surprisingly simple and elegant. It consists 
of first determining the optimal control to a deterministic linear quadratic 
regulator (LQR) problem: namely, the above LQG problem without wg and 
Wn. It happens that the solution to this problem can be written in terms of 
the simple state feedback law 


u(t) = —K,«(t) (9.10) 


where K; is a constant matrix which is easy to compute and is clearly 
independent of W and V, the statistical properties of the plant noise. The next 


step is to find an optimal estimate ¢ of the state x, so that E {(e — 4)" [a — al} 


is minimized. The optimal state estimate is given by a Kalman filter and is 
independent of Q and R. The required solution to the LQG problem is then 
found by replacing x by @, to give u(t) = —K,@(t). We therefore see that the 
LQG problem and its solution can be separated into two distinct parts, as 
illustrated in Figure 9.3. 

We will now give the equations necessary to find the optimal state-feedback 
matrix K, and the Kalman Filter. 

Optimal state feedback. The LQR problem, where all the states are 
known, is the deterministic initial value problem: Given the system # = 
Az + Bu with a given non-zero initial state x(0), find the input signal u(t) 
which the system back to the zero state (x = 0) in an optimal manner, i... 
by minimizing the deterministic cost 


Jp = (a(t)? Qa(t) + u(t)? Ru(t))dt (9.11) 
0 
The optimal solution is for any initial state u(t) = —K,x(t) where 
Kea RUBE X (9.12) 


and X = XT > 0 is the unique positive-semidefinite solution of the algebraic 
Riccati equation 


ATX 4+XA-—XBR'‘BTX+Q=0 (9.13) 
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Deterministic . 
linear quadratic Kalman Filter 


regulator : 
constant, —K, dynamic system 


Figure 9.3: The Separation Theorem 
Kalman filter. The Kalman filter has the structure of an ordinary state- 
estimator or observer, as shown in Figure 9.4, with 
@ = AF + Bu+ K;(y — C2) (9.14) 


The optimal choice of Ky, which minimizes E {(e — 2)" [x — 4] i, is given by 


K;=YC'TV" (9.15) 


where Y = YT > O is the unique positive-semidefinite solution of the algebraic 
Riccati equation 


YA? + AY —YCTV-1CY + W=0 (9.16) 


LQG: Combined optimal state estimation and optimal state 
feedback. The LQG control problem is to minimize J in (9.9). The structure 
of the LQG controller is illustrated in Figure 9.4; its transfer function, from 
y to u, is easily shown to be given by 


s | 4—BK,—K;C|K 
Kraa(s) = | —K, ; | 0 
ABR BTx—Yotv-ic¢ | yotv— 
~ —RTBTX | O Oty) 


It has the same degree (number of poles) as the plant. 
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Kalman 
Filter 


LQ Regulator 


Figure 9.4: The LQG controller and noisy plant 


Remark. The optimal gain matrices Ky and K, exist, and the LQG-controlled 
system is internally stable, provided the systems with state-space realizations 
(A, B,Q?) and (A, w?, C) are stabilizable and detectable. 


Exercise 9.2 For the plant and LQG controller arrangement of Figure 9.4, show 
that the closed-loop dynamics are described by 


ad x = A-—BK, BK, x ec 

dt | «—@ a 0 A—K;C L—-f Wn 
This shows that the closed-loop poles are simply the union of the poles of the 
deterministic LQR system (eigenvalues of A— BK,) and the poles of the Kalman 


filter (eigenvalues of A— KysC). It is exactly as we would have expected from the 
Separation Theorem. 


Exercise 9.3 In Figure 9.5, a reference input r is introduced into the LQG 
controller-plant configuration and the controller is rearranged to illustrate its two 
degrees-of-freedom structure. Show that the transfer function of the controller K(s) 


linking [ pty? i to u is 
K(s) = [ Ki(s) Ka(s) ] 
= —K,(sI-A+BK,+K;C)"[B K;]+[JI 0] 
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Figure 9.5: A rearranged LQG controller with reference input 


If the LQG controller is stable, for which there is no guarantee, then K1(s) can be 
implemented as a pre-filter with K2(s) the feedback controller. However, since the 
degrees (number of states) of K1(s), K2(s) and Krea(s) are in general the same, 
there is an increase in complexity on implementing K1(s) and K2(s) separately and 
so this is not recommended. 


Example 9.1 LQG design for inverse response process. We will here apply 
LQG to the SISO inverse response process introduced in Chapter 2. Recall from 
(2.27) that the plant model has a RHP-zero and is given by G(s) = CEE STTTRESUE 
The standard LQG design procedure does not give a controller with integral action, 
so we augment the plant G(s) with an integrator before starting the design, and 
then use this integrator as part of the final controller. For the objective function 
J= [(a7Qa +u™ Ru)dt we use the weights Q = qC™C, with q=1, and R= 1. 
(Only the ratio between q and R matters and reducing R yields a faster response.) 
Our choice for Q with the term C7 C implies that we are weighting the output y rather 
than the states. Because we have augmented the plant with an integrator the weight R 
penalizes the derivative of the input, du/dt. For the noise weights we select W = wl, 
where w = 1, (process noise directly on the states) and V = 1 (measurement noise). 
(Only the ratio between w and V matter and reducing V yields a faster response.) 
The MATLAB file in Table 9.1 was used to design the LQG controller. 

The resulting closed-loop response is shown in Figure 9.6. The response is not 
very good, nor is it easy to improve by adjusting the weights. One reason for this 
is that in our formulation we have penalized du/dt which makes it difficult for the 
controller to respond quickly to changes. For a comparison, see the loop-shaping 
design in Figure 2.17. 
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25 30 35 40 45 50 


Time [sec] 


Figure 9.6: LQG design for inverse response process. Closed-loop response to unit 


step in reference. 


Table 9.1: MATLAB commands 


ExamplE 9.1. 


to generate LQG controller in 


¥, Uses the Robust control toolbox 
G=nd2sys([-2,1],[50 15 1],3); 

int = nd2sys(1,[1 0.0001]); 

y 

vA 

Gs = mmult(G,int); 

[A,B,C,D] = unpck(Gs) ; 


Q = 1.0*C’*C; 
R= 13 
h 


Kx = lgr(A,B,Q,R); 

Bnoise = eye(size(A)); 

W = eye(size(A)); 

vei; 

h 

Ke = lge(A,Bnoise,C,W,V); 
[Ac,Bc,Cc,Dc] = reg(A,B,C,D,Kx,Ke); 
Ks = pck(Ac,Bc,Cc,Dc); 

Klqg = mmult(Ks,int); 

h 


original plant. 

integrator (moved slightly into LHP for 
numerical reasons). 

augmented plant (with integrator). 
Augmented Plant is [A,B,C,D]. 

weight on outputs. 

weight on inputs, R small gives faster 
response. 

optimal regulator. 

process noise directly on states. 
process noise. 

measurement noise, V small gives faster 
response. 

optimal estimator. 

combine regulator and estimator. 


include integrator in final 
controller. 
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9.2.2 Robustness properties 


For an LQG-controlled system with a combined Kalman filter and LQR 
control law there are no guaranteed stability margins. This was brought 
starkly to the attention of the control community by Doyle (1978). He showed, 
by example, that there exist LQG combinations with arbitrarily small gain 
margins. 

However, for an LQR-controlled system (i.e. assuming all the states are 
available and no stochastic inputs) it is well known (Kalman, 1964; Safonov 
and Athans, 1977) that, if the weight R is chosen to be diagonal, the sensitivity 
function S = (I+ K,(sI — A)‘ B)~' satisfies 


&(S(jw)) <1, Vw (9.18) 


From this it can be shown that the system will have a gain margin equal 
to infinity, a gain reduction margin equal to 0.5, and a (minimum) phase 
margin of 60° in each plant input control channel. This means that in the 
LQR-controlled system u = —K,2, a complex perturbation diag {kyel® } can 
be introduced at the plant inputs without causing instability providing 


(i) 6; =0 and 0.5 < kj < co,4=1,2,...,m 
or 
(ii) kj = 1 and |;| < 60°, i=1,2,...,m 


where m is the number of plant inputs. For a single-input plant, the above 
shows that the Nyquist diagram of the open-loop regulator transfer function 
K,(sI — A)~'B will always lie outside the unit circle with centre —1. This 
was first shown by Kalman (1964), and is illustrated in Example 9.2 below. 


Example 9.2 LQR design of a first order process. Consider a first order 
process G(s) =1/(s — a) with the state-space realization 


a(t) =aa(t)+u(t), y(t) =e?) 


s0 that the state is directly measured. For a non-zero initial state the cost function 
to be minimized is 


Ir = i (a” + Ru” )dt 
0 
The algebraic Riccati equation (9.13) becomes (A =a, B=1, Q =1) 


aX+Xa—-XR'X4+1=0 & X*®-2aRX—R=0 


which since X >0 gives X =aR+ /(aR)?2+R. The optimal control is given by 
u=—K,« where from (9.12) 


K, =X/R=a+/a?4+1/R 
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and we get the closed-loop system 


t=ar+u=—-Vae+1/Rea 


The closed-loop pole is located at s = —,/a?+1/R <0. Thus, the root locus for the 
optimal closed-loop pole with respect to R staris at s = —|a| for R = c (infinite 
weight on the input) and moves to —oo along the real atis as R approaches zero. 
Note that the root locus is identical for stable (a < 0) and unstable (a > 0) plants 
G(s) with the same value of |a|. In particular, fora > 0 we see that the minimum 
input energy needed to stabilize the plant (corresponding to R = 00) is obtained with 


the input wu = —2|a\%, which moves the pole from s = a to its mirror image at 
$s=—a. 

For R small (“cheap control”) the gain crossover frequency of the loop transfer 
function L= GK, = K,/(s — a) is given approzimately by w. © \/1/R. Note also 
that L(jw) has a roll-off of -1 at high frequencies, which is a general property of 
LQR designs. Furthermore, the Nyquist plot of L(jw) avoids the unit disc centred 
on the critical point —1, that is |S(jw)| = 1/|1+ L(jw)| < 1 at all frequencies. This 
is obvious for the stable plant with a <0 since K, >0 and then the phase of L(jw) 
varies from 0° (at zero frequency) to —90° (at infinite frequency). The surprise is 
that it is also true for the unstable plant even though the phase of L(jw) varies from 
—180° to —90°. 


Consider now the Kalman filter shown earlier in Figure 9.4. Notice that it 
is itself a feedback system. Arguments dual to those employed for the LQR- 
controlled system can then be used to show that, if the power spectral density 
matrix V is chosen to be diagonal, then at the input to the Kalman gain matrix 
Ky there will be an infinite gain margin, a gain reduction margin of 0.5 and 
a minimum phase margin of 60°. Consequently, for a single-output plant, the 
Nyquist diagram of the open-loop filter transfer function C(sI —.A)~'K; will 
lie outside the unit circle with centre at —1. 

An LQR-controlled system has good stability margins at the plant inputs, 
and a Kalman filter has good stability margins at the inputs to Ky, so 
why are there no guarantees for LQG control? To answer this, consider the 
LQG controller arrangement shown in Figure 9.7. The loop transfer functions 
associated with the labeled points 1 to 4 are respectively 


Ii(s) = —K, [®(s)"!+ BK, + KyC] | KyC4(s)B 
= Kziaa(s)G(s) (9.19 
[2(s) = G(s)Kraa(s) (9.20 
[3(s) = K,®(s)B (regulator transfer function) (9.21 
L[4(s) = C&(s) Ky (filter transfer function) (9.22 
where 
B(s) 3 (sf — A) (9.23 
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Plant, G(s) 


Controller, Kr.ga(s) 


Figure 9.7: LQG-controlled plant 


Kiga(s) is as in (9.17) and G(s) = C®(s)B is the plant model. 

At points 3 and 4 we have the guaranteed robustness properties of the LQR 
system and the Kalman filter respectively. But at the actual input and output 
of the plant (points 1 and 2) where we are most interested in achieving good 
stability margins, we have complex transfer functions which in general give 
no guarantees of satisfactory robustness properties. Notice also that points 3 
and 4 are effectively inside the LQG controller which has to be implemented, 
most likely as software, and so we have good stability margins where they are 
not really needed and no guarantees where they are. 

Fortunately, for a minimum-phase plant procedures developed by 
Kwakernaak (1969) and Doyle and Stein (1979; 1981) show how, by a suitable 
choice of parameters, either L,(s) can be made to tend asymptotically to L3(s) 
or Ly(s) can be made to approach L4(s). These procedures are considered 
next. 
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9.2.3. Loop transfer recovery (LTR) procedures 


For full details of the recovery procedures, we refer the reader to the 
original communications (Kwakernaak, 1969; Doyle and Stein, 1979; Doyle 
and Stein, 1981) or to the tutorial paper by Stein and Athans (1987). We 
will only give an outline of the major steps here, since we will argue later 
that the procedures are somewhat limited for practical control system design. 
For a more recent appraisal of LTR, we recommend a Special Issue of the 
International Journal of Robust and Nonlinear Control, edited by Niemann 
and Stoustrup (1995). 

The LQG loop transfer function L2(s) can be made to approach C'®(s)K z, 
with its guaranteed stability margins, if K, in the LQR problem is designed 
to be large using the sensitivity recovery procedure of Kwakernaak (1969). It 
is necessary to assume that the plant model G(s) is minimum phase and that 
it has at least as many inputs as outputs. 

Alternatively, the LQG loop transfer function L,(s) can be made to 
approach K,®(s)B by designing A’y in the Kalman filter to be large using 
the robustness recovery procedure of Doyle and Stein (1979). Again, it is 
necessary to assume that the plant model G(s) is minimum phase, but this 
time it must have at least as many outputs as inputs. 

The procedures are dual and therefore we will only consider recovering 
robustness at the plant output. That is, we aim to make L2(s) = G(s) Kiaa(s) 
approximately equal to the Kalman filter transfer function C®(s)K. 

First, we design a Kalman filter whose transfer function C®(s)Ky is 
desirable. This is done, in an iterative fashion, by choosing the power spectral 
density matrices W and V so that the minimum singular value of C®(s) Ky 
is large enough at low frequencies for good performance and its maximum 
singular value is small enough at high frequencies for robust stability, as 
discussed in section 9.1. Notice that W and V are being used here as design 
parameters and their associated stochastic processes are considered to be 
fictitious. In tuning W and V we should be careful to choose V as diagonal and 
W =(BS)(BS)", where S is a scaling matrix which can be used to balance, 
raise, or lower the singular values. When the singular values of C'®(s)Ky are 
thought to be satisfactory, loop transfer recovery is achieved by designing K,. 
in an LQR problem with Q = CTC and R = pl, where p is a scalar. As p 
tends to zero G(s) Kiqq tends to the desired loop transfer function C'®(s) Ky. 

Much has been written on the use of LTR procedures in multivariable 
control system design. But as methods for multivariable loop-shaping they 
are limited in their applicability and sometimes difficult to use. Their main 
limitation is to minimum phase plants. This is because the recovery procedures 
work by cancelling the plant zeros, and a cancelled non-minimum phase zero 
would lead to instability. The cancellation of lightly damped zeros is also of 
concern because of undesirable oscillations at these modes during transients. 
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A further disadvantage is that the limiting process (9 — 0) which brings 
about full recovery also introduces high gains which may cause problems 
with unmodelled dynamics. Because of the above disadvantages, the recovery 
procedures are not usually taken to their limits (9 — 0) to achieve full 
recovery, but rather a class of designs obtained (for small p) and an acceptable 
one selected. The result is a somewhat ad-hoc design procedure in which the 
singular values of a loop transfer function, G(s)Ktaa(s) or Kiaa(s)G(s), 
are indirectly shaped. A more direct and intuitively appealing method for 
multivariable loop-shaping will be given in section 9.4. 


9.3 Hy and H, control 


Motivated by the shortcomings of LQG control there was in the 1980’s 
a significant shift towards H. optimization for robust control. This 
development originated from the influential work of Zames (1981), although 
an earlier use of H.. optimization in an engineering context can be found 
in Helton (1976). Zames argued that the poor robustness properties of LQG 
could be attributed to the integral criterion in terms of the Hz norm, and 
he also criticized the representation of uncertain disturbances by white noise 
processes as often unrealistic. As the 7. theory developed, however, the two 
approaches of Hy and H., control were seen to be more closely related than 
originally thought, particularly in the solution process; see for example Glover 
and Doyle (1988) and Doyle, Glover, Khargonekar and Francis (1989). In this 
section, we will begin with a general control problem formulation into which 
we can cast all Hz and H. optimizations of practical interest. The general 
Hy and H» problems will be described along with some specific and typical 
control problems. It is not our intention to describe in detail the mathematical 
solutions, since efficient, commercial software for solving such problems is now 
so easily available. Rather we seek to provide an understanding of some useful 
problem formulations which might then be used by the reader, or modified to 
suit his or her application. 


9.3.1 General control problem formulation 


There are many ways in which feedback design problems can be cast as H2 
and Ho optimization problems. It is very useful therefore to have a standard 
problem formulation into which any particular problem may be manipulated. 
Such a general formulation is afforded by the general configuration shown in 
Figure 9.8 and discussed earlier in Chapter 3. The system of Figure 9.8 is 
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Figure 9.8: General contro] configuration 


described by 


=| Ba Bale) 1] 
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The signals are: wu the control variables, v the measured variables, w the 
exogenous signals such as disturbances wa and commands r, and z the so- 
called “error” signals which are to be minimized in some sense to meet the 
control objectives. As shown in (3.98) the closed-loop transfer function from 
w to z is given by the linear fractional transformation 


z= (P,K)w (9.27) 


where 
Fi(P, K) = Py + Pio K (I — Po K)7' Po (9.28) 


Hy and H., control involve the minimization of the H. and H. norms of 
F\(P, K) respectively. We will consider each of them in turn. 

First some remarks about the algorithms used to solve such problems. 
The most general, widely available and widely used algorithms for Hz and 
H.. control problems are based on the state-space solutions in (Glover 
and Doyle, 1988) and (Doyle et al., 1989). It is worth mentioning again 
that the similarities between Hz and H. theory are most clearly evident 
in the aforementioned algorithms. For example, both Hz and H» require 
the solutions to two Riccati equations, they both give controllers of state- 
dimension equal to that of the generalized plant P, and they both exhibit 
a separation structure in the controller already seen in LQG control. An 
algorithm for H., control problems is summarized in Section 9.3.4. 

The following assumptions are typically made in Hg and H» problems: 
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(Al) (A, Bo, C2) is stabilizable and detectable. 
(A2) Dy and Da, have full rank. 


(A3 ee has full column rank for all w. 
C1 Di 

(A4 Benet Ba has full row rank for all w. 
C2 D2, 


(A5 Diy = 0 and Doe =0: 


Assumption (A1) is required for the existence of stabilizing controllers K’, and 
assumption (A2) is sufficient to ensure the controllers are proper and hence 
realizable. Assumptions (A3) and (A4) ensure that the optimal controller does 
not try to cancel poles or zeros on the imaginary axis which would result 
in closed-loop instability. Assumption (A5) is conventional in Hy» control. 
D,, =0 makes P,, strictly proper. Recall that Hz is the set of strictly proper 
stable transfer functions. D2. = 0 makes Ps. strictly proper and simplifies 
the formulae in the Hz algorithms. In Ho, neither Dj, = 0, nor Doz = 0, is 
required but they do significantly simplify the algorithm formulae. If they are 
not zero, an equivalent #/., problem can be constructed in which they are; 
see (Safonov, Limebeer and Chiang, 1989) and (Green and Limebeer, 1995). 
For simplicity, it is also sometimes assumed that D1. and D2, are given by 


(M6) Dig= ° 


7 | and Dar = [ 0 |i 


This can be achieved, without loss of generality, by a scaling of u and v and 
a unitary transformation of w and z; see for example Maciejowski (1989) .In 
addition, for simplicity of exposition, the following additional assumptions are 
sometimes made 


(A8) (A,B) is stabilizable and (A,C;) is detectable. 


Assumption (A7) is common in Hz control e.g. in LQG where there are no 
cross terms in the cost function (D7,C; = 0), and the process noise and 
measurement noise are uncorrelated (B, DJ, = 0). Notice that if (A7) holds 
then (A3) and (A4) may be replaced by (A8). 

Whilst the above assumptions may appear daunting, most sensibly posed 
control problems will meet them. Therefore, if the software (e.g. p-tools or 
the robust control toolbox of MATLAB) complains, then it probably means 
that your control problem is not well formulated and you should think again. 

Lastly, it should be said that H., algorithms, in general, find a sub-optimal 
controller. That is, for a specified + a stabilizing controller is found for which 
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|Fi(P, K)|lo < y. If an optimal controller is required then the algorithm 
can be used iteratively, reducing y until the minimum is reached within a 
given tolerance. In general, to find an optimal #1. controller is numerically 
and theoretically complicated. This contrasts significantly with H theory, in 
which the optimal controller is unique and can be found from the solution of 
just two Riccati equations. 


9.3.2 Hy optimal control 


The standard Hz optimal control problem is to find a stabilizing controller K 
which minimizes 


|| F(s)l2 = / = A ” Fw) F(ju)"do; F = F(P,K) (9.29) 


For a particular problem the generalized plant P will include the plant model, 
the interconnection structure, and the designer specified weighting functions. 
This is illustrated for the LQG problem in the next subsection. 

As discussed in Section 4.10.1 and noted in Tables A.1 and A.2 on page 560, 
the Hz norm can be given different deterministic interpretations. It also 
has the following stochastic interpretation: Suppose in the general control 
configuration that the exogenous input w is white noise of unit intensity. 
That is: 


E {w(t)w(r)" } = 16(t — 7) (9.30) 


The expected power in the error signal z is then given by 


T 
ef Bn oe [sere (9.31) 
tr E {z(t)2(t)"} 


=e REE Gar a 
2a J 35 
(by Parseval’s Theorem) 
= |lFb = ||A(P, K)|5 (9.32) 


Thus by minimizing the Hz norm, the output (or error) power of the 
generalized system, due to a unit intensity white noise input, is minimized; 
we are minimizing the root-mean-square (rms) value of z. 
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9.3.3 LQG: a special H2 optimal controller 


An important special case of Hy optimal control is the LQG problem described 
in subsection 9.2.1. For the stochastic system 


g = Axr+But+wa (9.33) 
= Crt+wpy (9.34) 
where 
e{| ee [ walt)" wale)” | \. ve | se—7) (9.35) 
The LQG problem is to find u = K(s)y such that 
1/7 
Jae jim F | [77 Qr +ul Ru] a} (9.36) 


is minimized with Q = QT > 0 and R= R™ > 0. 
This problem can be cast as an Hy optimization in the general framework 
in the following manner. Define an error signal z as 


_[ Q? 0 £ 
and represent the stochastic inputs wg, Wp as 
1 
Wad _ W: 0 
fe ]-[¥ 9] ox 
where w is a white noise process of unit intensity. Then the LQG cost function 
is 
ee ae 2 
J=E<¢ lim = z(t)’ z(t)dt > = ||Fi(P, K)||5 (9.39) 
To00 T 0 
where 
2(s) = EF, (P, K)w(s) (9.40) 
and the generalized plant P is given by 
A|W? 0! 8B 
T j 
_| Pui Po |] s Q? 0 0 ; 0 
Pelee re 0) 0 oR (9.41) 
GO Yao 


The above formulation of the LQG problem is illustrated in the general 
setting in Figure 9.9. With the standard assumptions for the LQG problem, 
application of the general H2 formulae Doyle et al. (1989) to this formulation 
gives the familiar LQG optimal controller as in (9.17). 
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Figure 9.9: The LQG problem formulated in the general control configuration 


9.3.4 H optimal control 


With reference to the general control configuration of Figure 9.8, The standard 
Hex optimal control problem is to find all stabilizing controllers K which 
minimize 


|Fi(P, K) loo = max a(Fi(P, K)(jw)) (9.42) 


As discussed in Section 4.10.2 the H,.norm has several interpretations in 
terms of performance, One is that it minimizes the peak of the singular value 
of Fi(P(jw), K (jw). It also has a time domain interpretation as the induced 
(worst-case) two-norm: Let z = F,(P, K)w. Then 


Ilz(t) lo 
F,(P,K)|\o = max 9.43 
Fit ul w(t)40 ||w(t)|l2 0) 


where ||z(t)||2 = i >; |zs(t) |? dt is the 2-norm of the vector signal. 

In practice, it is usually not necessary to obtain an optimal controller for 
the H,.problem, and it is often computationally (and theoretically) simpler 
to design a sub-optimal one (i.e. one close to the optimal ones in the sense 
of the H,.norm). Let ymin be the minimum value of ||f)(P, K)||.. over all 
stabilizing controllers kK. Then the H.. sub-optimal control problem is: given 
a > Ymin, find all stabilizing controllers K such that 


|Fi(P, K)Iloo <7 


This can be solved efficiently using the algorithm of Doyle et al. (1989), and 
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by reducing y iteratively, an optimal solution is approached. The algorithm is 
summarized below with all the simplifying assumptions. 

General 7H, algorithm. For the general control configuration of 
Figure 9.8 described by equations (9.24)-(9.26), with assumptions (A1) to 
(A8) in Section 9.3.1, there exists a stabilizing controller K(s) such that 
||F(P, K)|loo <7 if and only if 


(i) Xoo > 0 is a solution to the algebraic Riccati equation 


ATX 4+ Xo At CPC) + Xoo (y 7? Bi BT — BoBE)X..=0 (9.44) 
such that Re A; [A +(y-?B BT - By BY) Xx] <0, Vi; and 
(ii) Y. > 0 is a solution to the algebraic Riccati equation 


AV xo + Yoo AT + Bi BT + Vou (y2CF C, — CPC2)V¥on =0 (9.45) 


such that Re A; [A + Yo(y 207 Ci, — CF C2)] < 0, Vi; and 
(iii) p(Xo0Yoo) <7” 
All such controllers are then given by K = F,(K,.,Q) where 


Aig, Beolisg “Boe Bs 
K.(s) = | Feo 0 T (9.46) 
=) I 0 


Pe BERNA S  E Ve Cl LS SER AVS XS) (9.47) 

Ago = A+ 77? BBE X55 + BoFoo + ZooDoo C2 (9.48) 

and Q(s) is any stable proper transfer function such that ||Qllo < y. For 
Q(s) =0, we get 

K(s) = Key (8) = —ZooLoo(sI — Aw) 7! Foo (9.49) 

This is called the “central” controller and has the same number of states as 


the generalized plant P(s). The central controller can be separated into a state 
estimator (observer) of the form 


@ = AF + Bry? BI X.F+Bout ZooLoo(Co# — y) (9.50) 
—— _ 


A 
Wworst 


and a state feedback 
U = Pook (9.51) 


Comparing the observer in (9.50) with the Kalman filter in (9.14) we see that 
it contains an additional term By Wworst, where Wworst Can be interpreted as 
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an estimate of the worst-case disturbance (exogenous input). Note that for 
the special case of #..loop shaping this extra term is not present. This is 
discussed in Section 9.4.4. 

y-iteration. If we desire a controller that achieves yin, to within a 
specified tolerance, then we can perform a bisection on y until its value is 
sufficiently accurate. The above result provides a test for each value of y to 
determine whether it is less than ymin or greater than ymin. 

Given all the assumptions (A1) to (A8) the above is the most simple form of 
the general H... algorithm. For the more general situation, where some of the 
assumptions are relaxed, the reader is referred to the original sources (Glover 
and Doyle, 1988) and (Doyle et al., 1989). In practice, we would expect a user 
to have access to commercial software such as MATLAB and its toolboxes. 

In Section 2.7, we distinguished between two methodologies for Hoo 
controller design: the transfer function shaping approach and the signal-based 
approach. In the former, #1. optimization is used to shape the singular 
values of specified transfer functions over frequency. The maximum singular 
values are relatively easy to shape by forcing them to lie below user defined 
bounds, thereby ensuring desirable bandwidths and roll-off rates. In the signal- 
based approach, we seek to minimize the energy in certain error signals given 
a set of exogenous input signals. The latter might include the outputs of 
perturbations representing uncertainty, as well as the usual disturbances, 
noise, and command signals. Both of these two approaches will be considered 
again in the remainder of this section. In each case we will examine a particular 
problem and formulate it in the general control configuration. 

A difficulty that sometimes arises with H. control is the selection of 
weights such that the H,, optimal controller provides a good trade-off between 
conflicting objectives in various frequency ranges. Thus, for practical designs 
it is sometimes recommended to perform only a few iterations of the Ho 
algorithm. The justification for this is that the initial design, after one 
iteration, is similar to an H2 design which does trade-off over various frequency 
ranges. Therefore stopping the iterations before the optimal value is achieved 
gives the design an He flavour which may be desirable. 


9.3.5 Mixed-sensitivity H,, control 


Mixed-sensitivity is the name given to transfer function shaping problems in 
which the sensitivity function S$ = (I +GK)~! is shaped along with one or 
more other closed-loop transfer functions such as KS or the complementary 
sensitivity function T = J — S. Earlier in this chapter, by examining a typical 
one degree-of-freedom configuration, Figure 9.1, we saw quite clearly the 
importance of S, KS, and T. 

Suppose, therefore, that we have a regulation problem in which we want 
to reject a disturbance d entering at the plant output and it is assumed 
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that the measurement noise is relatively insignificant. Tracking is not an 
issue and therefore for this problem it makes sense to shape the closed-loop 
transfer functions S and KS in a one degree-of-freedom setting. Recall that 
S is the transfer function between d and the output, and A'S the transfer 
function between d and the control signals. It is important to include A'S as 
a mechanism for limiting the size and bandwidth of the controller, and hence 
the control energy used. The size of KS is also important for robust stability 
with respect to uncertainty modelled as additive plant perturbations. 

The disturbance d is typically a low frequency signal, and therefore it will 
be successfully rejected if the maximum singular value of S is made small 
over the same low frequencies. To do this we could select a scalar low pass 
filter w1(s) with a bandwidth equal to that of the disturbance, and then find 
a stabilizing controller that minimizes ||w1S||... This cost function alone is 
not very practical. It focuses on just one closed-loop transfer function and for 
plants without right-half plane zeros the optimal controller has infinite gains. 
In the presence of a nonminimum phase zero, the stability requirement will 
limit the controller gains indirectly, but it is far more useful in practice to 
minimize 
w 8S 
| ek |, 052 


where we(s) is a scalar high pass filter with a crossover frequency 
approximately equal to that of the desired closed-loop bandwidth. 

In general, the scalar weighting functions w1(s) and w2(s) can be replaced 
by matrices W(s) and W2(s). This can be useful for systems with channels 
of quite different bandwidths when diagonal weights are recommended, but 
anything more complicated is usually not worth the effort. 


Remark. Note we have here outline a slightly different way of selecting the weights 
from that in Example 2.11 and Section 3.4.6. There Wi = Wp was selected with a 
crossover frequency equal to that of the desired closed-loop bandwidth and W2 = W, 
was selected as a constant, usually W, = I. 


To see how this mixed-sensitivity problem can be formulated in the general 


setting, we can imagine the disturbance d as a single exogenous input and 
define an error signal z = [ op ok lee where z1 = Wiy and zg = —Wou, 
as illustrated in Figure 9.10. It is not difficult from Figure 9.10 to show that 
21 = W, Sw and zo = W.K Sw as required, and to determine the elements of 


the generalized plant P as 
WG 


W, 
Pu=| ia Pie _W 
Py =—I Px =—-G 


(9.53) 


394 MULTIVARIABLE FEEDBACK CONTROL 


Figure 9.10: S/KS mixed-sensitivity optimization in standard form (regulation) 


where the partitioning is such that 


can 
_ Py Pi Ww 
2 ]=[% Bz] 38 
v 
and e 
W; 
F(P,K) = Wok (9.55) 


Another interpretation can be put on the S/KS mixed-sensitivity 
optimization as shown in the standard control configuration of Figure 9.11. 
Here we consider a tracking problem. The exogenous input is a reference 
command r, and the error signals are z1 = Wye and z. = Wou. As in 
the regulation problem of Figure 9.10, we have in this tracking problem 
z, = W,Sw and z2 = W2kSw. An example of the use of S/KS mixed 
sensitivity minimization is given in Chapter 12, where it is used to design 
a rotorcraft control law. In this helicopter problem, you will see that the 
exogenous input w is passed through a weight W3 before it impinges on the 
system. W3 is chosen to weight the input signal and not directly to shape S 
or KS. This signal-based approach to weight selection is the topic of the next 
sub-section. 

Another useful mixed sensitivity optimization problem, again in a one 
degree-of-freedom setting, is to find a stabilizing controller which minimizes 


I[ wr JI, 950 
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Figure 9.11: S/KS mixed-sensitivity minimization in standard form (tracking) 


The ability to be able to shape T is desirable for tracking problems and 
noise attenuation. It is also important for robust stability with respect to 
multiplicative perturbations at the plant output. The S/T mixed-sensitivity 
minimization problem can be put into the standard control configuration as 
shown in Figure 9.12. The elements of the corresponding generalized plant P 


are a 
_| Wy _ | -W 
Pa=| 0 Po=| WoG (9.57) 
Py =I Po. =—-G 
Exercise 9.4 For the cost function 
WiS 
WeT (9.58) 
WskS ||| 


formulate a standard problem, draw the corresponding control configuration and give 
expressions for the generalized plant P. 


The shaping of closed-loop transfer functions as described above with the 
“stacked” cost functions becomes difficult with more than two functions. With 
two, the process is relatively easy. The bandwidth requirements on each are 
usually complementary and simple, stable, low-pass and high-pass filters are 
sufficient to carry out the required shaping and trade-offs. We stress that the 
weights W; in mixed-sensitivity #1.. optimal control must all be stable. If they 
are not, assumption (A1) in Section 9.3.1 is not satisfied, and the general H. 
algorithm is not applicable. Therefore if we wish, for example, to emphasize 
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Figure 9.12: S/T mixed-sensitivity optimization in standard form 


the minimization of S at low frequencies by weighting with a term including 
1 


integral action, we would have to approximate 4 by me where € << 1. This is 
exactly what was done in Example 2.11. Similarly one might be interested in 
weighting KS with a non-proper weight to ensure that K is small outside the 
system bandwidth. But the standard assumptions preclude such a weight. The 
trick here is to replace a non-proper term such as (1+7)5) by (1+7s)/(1+728) 
where T2 << 7,. A useful discussion of the tricks involved in using “unstable” 
and “non-proper” weights in 71. control can be found in Meinsma (1995). 
For more complex problems, information might be given about several 
exogenous signals in addition to a variety of signals to be minimized and 
classes of plant perturbations to be robust against. In which case, the mixed- 
sensitivity approach is not general enough and we are forced to look at more 


advanced techniques such as the signal-based approach considered next. 


9.3.6 Signal-based H,, control 


The signal-based approach to controller design is very general and is 
appropriate for multivariable problems in which several objectives must be 
taken into account simultaneously. In this approach, we define the plant and 
possibly the model uncertainty, we define the class of external signals affecting 
the system and we define the norm of the error signals we want to keep small. 
The focus of attention has moved to the size of signals and away from the size 
and bandwidth of selected closed-loop transfer functions. 

Weights are used to describe the expected or known frequency content of 


CONTROL SYSTEM DESIGN 397 


exogenous signals and the desired frequency content of error signals. Weights 
are also used if a perturbation is used to model uncertainty, as in Figure 9.13, 
where G is the nominal model, W is a weighting function that captures the 
relative model fidelity over frequency, and A represents unmodelled dynamics 
usually normalized via W so that ||Allo. < 1; see Chapter 8 for more details. 
. As in mixed-sensitivity H.. control, the weights in signal-based H.. control 
need to be stable and proper for the general H1.. algorithm to be applicable. 


Figure 9.13: Multiplicative dynamic uncertainty model 


LQG control is a simple example of the signal-based approach, in which 
the exogenous signals are assumed to be stochastic (or alternatively impulses 
in a deterministic setting) and the error signals are measured in terms of 
the 2-norm. As we have already seen, the weights Q and R are constant, but 
LQG can be generalized to include frequency dependent weights on the signals 
leading to what is sometimes called Wiener-Hopf design, or simply H»2 control. 

When we consider a system’s response to persistent sinusoidal signals of 
varying frequency, or when we consider the induced 2-norm between the 
exogenous input signals and the error signals, we are required to minimize the 
H.. norm. In the absence of model uncertainty, there does not appear to be 
an overwhelming case for using the H.. norm rather than the more traditional 
Hy norm. However, when uncertainty is addressed, as it always should be, H 
is clearly the more natural approach using component uncertainty models as 
in Figure 9.13. 

A typical problem using the signal-based approach to H. control is 
illustrated in the interconnection diagram of Figure 9.14. G and Gq are 
nominal models of the plant and disturbance dynamics, and K is the controller 
to be designed. The weights Wy, W; and W,, may be constant or dynamic and 
describe the relative importance and/or frequency content of the disturbances, 
set points, and noise signals. The weight Wyrer is a desired closed-loop transfer 
function between the weighted set point r, and the actual output y. The 
weights W, and W,, reflect the desired frequency content of the error (y — ya) 
and the control signals u, respectively. The problem can be cast as a standard 
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Figure 9.14: A signal-based H.. control problem 


Hoo optimization in the general control configuration by defining 


d 


ZA 
w= T Z= 
7) 


n 


Li 
eS U=U 
Ym 


in the general setting of Figure 9.8. 

Suppose we now introduce a multiplicative dynamic uncertainty model at 
the input to the plant as shown in Figure 9.15. The problem we now want to 
solve is: find a stabilizing controller K such that the #.. norm of the transfer 
function between w and z is less than 1 for all A, where ||Al|,.. < 1. We have 
assumed in this statement that the signal weights have normalized the 2-norm 
of the exogenous input signals to unity. This problem is a non-standard Ho 
optimization. It is a Robust Performance problem for which the pu-synthesis 
procedure, outlined in Chapter 8, can be applied. Mathematically, we require 
the structured singular value 


(9.59) 


(MM (gw)) < 1, Vw (9.60) 
where M is the transfer function matrix between 

d 

r ms 

. and 29 (9.61) 

5 € 


and the associated block diagonal perturbation has 2 blocks: a fictitious 
Pont ie and [eq ze ie and an 


performance block between [ d™ rX n 
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Figure 9.15: An H.. robust performance problem 


uncertainty block A between 6 and «. Whilst the structured singular value is 
a useful analysis tool for assessing designs, p-synthesis is sometimes difficult 
to use and often too complex for the practical problem at hand. In its full 
generality, the p-synthesis problem is not yet solved mathematically; where 
solutions exist the controllers tend to be of very high order; the algorithms may 
not always converge and design problems are sometimes difficult to formulate 
directly. 

For many industrial control problems, a design procedure is required 
which offers more flexibility than mixed-sensitivity H. control, but is not 
as complicated as p-synthesis. For simplicity, it should be based on classical 
loop-shaping ideas and it should not be limited in its applications like LTR 
procedures. In the next section, we present such a controller design procedure. 


9.4 HH. loop-shaping design 


The loop-shaping design procedure described in this section is based on Hoo 
robust stabilization combined with classical loop-shaping, as proposed by 
McFarlane and Glover (1990). It is essentially a two stage design process. 
First, the open-loop plant is augmented by pre and post-compensators to give 
a desired shape to the singular values of the open-loop frequency response. 
Then the resulting shaped plant is robustly stabilized using H. optimization. 
An important advantage is that no problem-dependent uncertainty modelling, 
or weight selection, is required in this second step. 

We will begin the section with a description of the H. robust stabilization 
problem (Glover and McFarlane, 1989). This is a particularly nice problem 
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because it does not require y-iteration for its solution, and explicit formulae 
for the corresponding controllers are available. The formulae are relatively 
simple and so will be presented in full. 

Following this, a step by step procedure for H.» loop-shaping design is 
presented. This systematic procedure has its origin in the Ph.D. thesis of 
Hyde (1991) and has since been successfully applied to several industrial 
problems. The procedure synthesizes what is in effect a single degree-of- 
freedom controller. This can be a limitation if there are stringent requirements 
on command following. However, as shown by Limebeer, Kasenally and 
Perkins (1993), the procedure can be extended by introducing a second degree- 
of-freedom in the controller and formulating a standard H,., optimization 
problem which allows one to trade off robust stabilization against closed-loop 
model-matching. We will describe this two degrees-of-freedom extension and 
further show that such controllers have a special observer-based structure 
which can be taken advantage of in controller implementation. 


9.4.1 Robust stabilization 


For multivariable systems, classical gain and phase margins are unreliable 
indicators of robust stability when defined for each channel (or loop), taken 
one at a time, because simultaneous perturbations in more than one loop are 
not then catered for. More general perturbations like diag{k;} and diag {e?” i, 
as discussed in section 9.2.2, are required to capture the uncertainty, but even 
these are limited. It is now common practice, as seen in Chapter 8, to model 
uncertainty by norm-bounded dynamic matrix perturbations. Robustness 
levels can then be quantified in terms of the maximum singular values of 
various closed-loop transfer functions. 

The associated robustness tests, for a single perturbation, as described in 
Chapter 8, require the perturbation to be stable. This restricts the plant 
and perturbed plant models to have the same number of unstable poles. To 
overcome this, two stable perturbations can be used, one on each of the factors 
in a coprime factorization of the plant, as shown in section 8.6.3. Although this 
uncertainty description seems unrealistic and less intuitive than the others, 
it is in fact quite general, and for our purposes it leads to a very useful Ho 
robust stabilization problem. Before presenting the problem, we will first recall 
the uncertainty model given in (8.62). 

We will consider the stabilization of a plant G which has a normalized left 
coprime factorization (as discussed in Section 4.1.5) 


G=M!N (9.62) 


where we have dropped the subscript from M and N for simplicity. A 
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Figure 9.16: 7. robust stabilization problem 


perturbed plant model G, can then be written as 
Gp = (M+ Ay)71(N + An) (9.63) 


where Ay, An are stable unknown transfer functions which represent 
the uncertainty in the nominal plant model G. The objective of robust 
stabilization it to stabilize not only the nominal model G, but a family of 
perturbed plants defined by 


Gp = {(M + Am) "(N+ An): ||[ An Am ] lloo <e} (9.64) 


where € > 0 is then the stability margin. To maximize this stability margin 
is the problem of robust stabilization of normalized coprime factor plant 
descriptions as introduced and solved by Glover and McFarlane (1989). 

For the perturbed feedback system of Figure 9.16, as already derived in 
(8.64), the stability property is robust if and only if the nominal feedback 
system is stable and 


il 
<= 
€ 


A l| K | (P=6K)r yw! (0.65) 


| [o.e) 


Notice that ¥ is the H.. norm from ¢ to (*) and (I-GK)~" is the sensitivity 


function for this positive feedback arrangement. 
The lowest achievable value of y and the corresponding maximum stability 
margin € are given by Glover and McFarlane (1989) as 


1 


Ymin = Enns = {1 (IN Mir} “ =(.+0(X2Z))# (9.66) 


where || - ||77 denotes Hankel norm, p denotes the spectral radius (maximum 
eigenvalue), and for a minimal state-space realization (A,B,C, D) of G, Z is 
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the unique positive definite solution to the algebraic Riccati equation 


(A— BS-'D?C)Z + Z(A— BS-'DTC)? — ZCTR'CZ+ BS ‘BT =0 
(9.67) 
where 
R=I+DD", S=I1+D"D 


and X is the unique positive definite solution of the following algebraic Riccati 
equation 


(A—BS-1DTC)TX + X(A— BS-1D7C) — XBS"'B7X +CTR'C =0 
(9.68) 
Notice that the formulae simplify considerable for a strictly proper plant, i.e. 
when D = 0. 
A controller (the ‘central’ controller in McFarlane and Glover (1990)) which 
guarantees that 


_ —1yf-1 
ee 000) 
for a specified 7 > Ymin, is given by 
s [A+ BF4+7°(L7) 1207 (C4 DF) | (£7)712ZCT 
Kk 2 | BIX | I (9.70) 
F = -—§1(D?'C+BTX) (9.71) 
L= (@-Y)I+XxzZ. (9.72) 


The MATLAB function coprimeunc, listed in Table 9.2, can be used to 
generate the controller in (9.70). It is important to emphasize that since we 
can compute Ymin from (9.66) we get an explicit solution by solving just two 
Riccati equations (aresolv) and avoid the 7-iteration needed to solve the 
general H.. problem. 


Remark 1 An example of the use of coprimeunc is given in Example 9.3 below. 


Remark 2 Notice that, if y = Ymin in (9.70), then L = —p(XZ)I 4+ XZ, 
which is singular, and thus (9.70) cannot be implemented. If for some unusual 
reason the truly optimal controller is required, then this problem can be resolved 
using a descriptor system approach, the details of which can be found in (Chiang 
and Safonov, 1992; Safonov, Jonckheere, Verma and Limebeer, 1987; Safonov 
et al., 1989). 


Remark 3 Alternatively, from Glover and McFarlane (1989), all controllers 
achieving y = ymin are given by K = UV~', where U and V are stable, (U,V) 
is a right coprime factorization of K, and U,V satisfy 


| ge 4 u || =|IN Min (9.73) 
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Table 9.2: MATLAB function to generate the H.. controller in (9.70). 


i, Uses the Robust control or Mu toolbox 
function [Ac,Bc,Cc,Dc, gammin]=coprimeunc(a,b,c,d,gamrel) 
h 
% Finds the controller which optimally robustifies a given shaped plant 
% in terms of tolerating maximum coprime uncertainty. 
h 
% INPUTS: 
% a,b,c,d: State-space description of (shaped) plant. 
% gamrel: gamma used is gamrel*gammin (typical gamrel=1.1) 


%, OUTPUTS: 
% Ac,Bc,Cc,Dce: "Robustifying" controller (positive feedback). 


S=eye(size(d’*d) )+d’ *d; 

R=eeye(size(d*d’))+d*d’ ; 

Al=a-b*inv(S)*d’?*c; 

Qi=c’ *inv(R)*c; 

Ri=b*inv(S)*b’; 

[x1,x2,eig,xerr,wellposed,X] = aresolv(A1,Q1,R1); 


% Alt. Mu toolbox: [x1,x2, fail, reig_-min] = ric_schr([A1 -R1; -Q1 -A1’]); X = x2/x1; 


[x1,x2,eig,xerr,wellposed,Z] = aresolv(A1’,R1,Q1); 


% Alt. Mu toolbox: [x1, x2, fail, reig-min] = ric_schr([A1’ -Q1; -R1 -A1]); Z = x2/x1; 


% Optimal gamma: 
gammin=sqrt (1+max (eig (X*Z))) 

% Use higher gamma..... 

gam = gamrel*gammin; 

L=(1-gam*gam) *eye(size(X*Z)) + X*Z; 
F=-inv (S)*(d? *c+b? *X) ; 
Ac=atb*Ft+gam*gam*inv(L’ )*Z*c’ *(c+d*F) ; 
Bc=gam*gam*inv (L’ )*Z*c’ ; 

Cc=b’ *X; 

Dc=-d’; 
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The determination of U and V is a Nehari extension problem: that is, a problem 
in which an unstable transfer function R(s) is approximated by a stable transfer 
function Q(s), such that ||R + Q||.o is minimized, the minimum being||R*||7. A 
solution to this problem is given in Glover (1984). 


Exercise 9.5 Formulate the Ho robust stabilization problem in the general control 
configuration of Figure 9.8, and determine a transfer function expression and a state- 
space realization for the generalized plant P. 


9.4.2 A systematic 1, loop-shaping design procedure 


Robust stabilization alone is not much use in practice because the designer is 
not able to specify any performance requirements. To do this McFarlane and 
Glover (1990) proposed pre- and post-compensating the plant to shape the 
open-loop singular values prior to robust stabilization of the “shaped” plant. 

If W, and W2 are the pre- and post-compensators respectively, then the 
shaped plant G's is given by 


G;=WGW, (9.74) 


as shown in Figure 9.17. The controller K, is synthesized by solving the 


Figure 9.17: The shaped plant and controller 


robust stabilization problem of section 9.4.1 for the shaped plant G, with a 
normalized left coprime factorization G; = M,;1Ns. The feedback controller 
for the plant G is then K = W,K,W2. The above procedure contains all the 
essential ingredients of classical loop-shaping, and can easily be implemented 
using the formulae already presented and reliable algorithms in, for example, 
MATLAB. 

We first present a simple SISO example, where W2 = 1 and we select W; to 
get acceptable disturbance rejection. We will afterwards present a systematic 
procedure for selecting the weights W, and Wp. 


Example 9.3 Glover-McFarlane H,. loop shaping for the disturbance 
process. Consider the disturbance process in (2.55) which was studied in detail 
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Figure 9.18: Glover-McFarlane loop-shaping design for the disturbance process. 
Dashed line: Initial “shaped” design, G. Solid line: “Robustified” design, G; Kk. 


in Chapter 2, 


200 1 Gals) = 200 
10s+1(0.05s+1)2’ "10841 


G(s) (9.75) 


We want as good disturbance rejection as possible, and the gain crossover frequency 
We for the final design should be about 10 [rad/s]. 

In Example 2.8 we argued that for acceptable disturbance rejection with minimum 
input usage, the loop shape (“shaped plant”) should be similar to |Ga|. Neglecting the 
high-frequency dynamics in G(s) this yields an initial weight W; = 0.5. To improve 
the performance at low frequencies we add integral action, and we also add a phase- 
advance term s + 2 to reduce the slope for L from —2 at lower frequencies to about 
—1 at crossover. Finally, to make the response a little faster we multiply the gain by 


a factor 2 to get the weight 
2 
w= (9.76) 


This yields a shaped plant G; = GW, with a gain crossover frequency of 13.7 [rad/s], 
and the magnitude of G(jw) is shown by the dashed line in Figure 9.18 (a). The 
response to a unit step in the disturbance response is shown by the dashed line in 
Figure 9.18 (b), and, as may expected, the response with the “controller” K = W, 
is too oscillatory. 

We now “robustify” this design so that the shaped plant tolerates as much Heo 
coprime factor uncertainty as possible. This may be done with the MATLAB u- 
toolbor using the command ncfsyn or with the MATLAB Robust Control toolbox 
using the function coprimeunc given in Table 9.2: 


[Ac,Bc,Cc,Dc, gammin]=coprimeunc(A,B,C,D,gamrel) 


Here the shaped plant Gs = GW has state-space matrices A, B,C and D, and the 
function returns the “robustifying” positive feedback controller K, with state-space 
matrices Ac, Bc,Cc and Dc. In general, K, has the same number of poles (states) 
as G;. gamrel is the value of y relative to ymin, and was in our case selected as 
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1.1. The returned variable gammin (ymin ) is the inverse of the magnitude of coprime 
uncertainty we can tolerate before we get instability. We want ymin > 1 as small 
as possible, and we usually require that ymin is less than 4, corresponding to 25% 
allowed coprime uncertainty. 

Applying this to our example gives ymin = 2.34 and an overall controller K = 
WK, with 5 states (G;, and thus K;, has 4 states, and W has 1 state). The 
corresponding loop shape |G;K;,| is shown by the solid line in Figure 9.18 (a). We 
see that the change in the loop shape is small, and we note with interest that the 
slope around crossover is somewhat gentler. This translates into better margins: The 
gain margin (GM) is improved from 1.62 (for G;) to 3.48 (forG, Kz), and the phase 
margin (PM) is improved from 13.2° to 51.5°. The gain crossover frequency we is 
reduced slightly from 18.7 to 10.8 [rad/s]. The corresponding disturbance response is 
shown in Figure 9.18 (b) and is seen to be much improved. 


Remark. The response with the controller K = WK, is quite similar to that 
the loop-shaping controller K3(s) designed in Chapter 2 (see curves Lz and yz in 
Figure 2.21). The response for reference tracking with controller K = W1Kg is not 
shown, it is also very similar to that with Kz (see Figure 2.23), but it has a slightly 
smaller overshoot of 21% rather than 24%. To reduce this overshoot we would need 
to use a two degrees-of-freedom controller. 


Exercise 9.6 Design an Hoo loop-shaping controller for the disturbance process in 
(9.75) using the weight Wi in (9.76), t.e.generate the plots corresponding to those in 
Figure 9.18. Next, repeat the design with Wi = 2(s+3)/s (which results in an initial 
G, which would yield closed-loop instability with K. = 1). Compute the gain and 
phase margins and compare the disturbance and reference responses. In both cases 
find we and use (2.38) to compute the maximum delay that can be tolerated in the 
plant before instability arises. 


Skill is required in the selection of the weights (pre- and post-compensators 
W, and W2), but experience on real applications has shown that robust 
controllers can be designed with relatively little effort by following a few simple 
rules. An excellent illustration of this is given in the thesis of Hyde (1991) 
who worked with Glover on the robust control of VSTOL (vertical and/or 
short take-off and landing) aircraft. Their work culminated in a successful 
flight test of H,. loop-shaping control laws implemented on a Harrier jump- 
jet research vehicle at the UK Defence Research Agency, Bedford in 1993. The 
H.« loop-shaping procedure has also been extensively studied and worked on 
by Postlethwaite and Walker (1992) in their work on advanced control of high 
performance helicopters, also for the UK DRA at Bedford. This application 
is discussed in detail in the helicopter case study in Section 12.2. 

Based on these, and other studies, it is recommended that the following 
systematic procedure is followed when using H. loop-shaping design: 


1. Scale the plant outputs and inputs. This is very important for most design 
procedures and is sometimes forgotten. In general, scaling improves the 
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conditioning of the design problem, it enables meaningful analysis to be 
made of the robustness properties of the feedback system in the frequency 
domain, and for loop-shaping it can simplify the selection of weights. There 
are a variety of methods available including normalization with respect to 
the magnitude of the maximum or average value of the signal in question. 
Scaling with respect to maximum values is important if the controllability 
analysis of earlier chapters is to be used. However, if one is to go straight 
to a design the following variation has proved useful in practice: 


(a) The outputs are scaled such that equal magnitudes of cross-coupling 
into each of the outputs is equally undesirable. 

(b) Each input is scaled by a given percentage (say 10%) of its expected 
range of operation. That is, the inputs are scaled to reflect the relative 
actuator capabilities. An example of this type of scaling is given in the 
aero-engine case study of Chapter 12. 


2. Order the inputs and outputs so that the plant is as diagonal as possible. 
The relative gain array can be useful here. The purpose of this pseudo- 
diagonalization is to ease the design of the pre- and post-compensators 
which, for simplicity, will be chosen to be diagonal. 


Next, we discuss the selection of weights to obtain the shaped plant G, = 
W2GW, where 
W, = W,W.W, (9.77) 


3. Select the elements of diagonal pre- and post-compensators W, and W2 so 
that the singular values of W2G'W, are desirable. This would normally 
mean high gain at low frequencies, roll-off rates of approximately 20 
dB/decade (a slope of about —1) at the desired bandwidth(s), with higher 
rates at high frequencies. Some trial and error is involved here. W2 is usually 
chosen as a constant, reflecting the relative importance of the outputs to be 
controlled and the other measurements being fed back to the controller. For 
example, if there are feedback measurements of two outputs to be controlled 
and a velocity signal, then W2 might be chosen to be diag/1, 1, 0.1], where 
0.1 is in the velocity signal channel. W, contains the dynamic shaping. 
Integral action, for low frequency performance; phase-advance for reducing 
the roll-off rates at crossover; and phase-lag to increase the roll-off rates at 
high frequencies should all be placed in W, if desired. The weights should 
be chosen so that no unstable hidden modes are created in G',. 

4. Optional: Align the singular values at a desired bandwidth using a further 
constant weight W, cascaded with W,. This is effectively a constant 
decoupler and should not be used if the plant is ill-conditioned in 
terms of large RGA elements (see Section 6.10.4). The align algorithm 
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of Kouvaritakis (1974) which has been implemented in the MATLAB 
Multivariable Frequency-Domain Toolbox is recommended. 


. Optional: Introduce an additional gain matrix W, cascaded with W, to 


provide control over actuator usage. W, is diagonal and is adjusted so that 
actuator rate limits are not exceeded for reference demands and typical 
disturbances on the scaled plant outputs. This requires some trial and 
error. 


. Robustly stabilize the shaped plant G; = W2GW 1, where W; = W,W.W,, 


using the formulae of the previous section. First, calculate the maximum 
stability margin €max = 1/Ymin- If the margin is too small, €max < 0.25, then 
go back to step 4 and modify the weights. Otherwise, select y > min, by 
about 10%, and synthesize a suboptimal controller using equation (9.70). 
There is usually no advantage to be gained by using the optimal controller. 
When €max > 0.25 (respectively ymin < 4) the design is usually successful. 
In this case, at least 25% coprime uncertainty is allowed, and we also 
find that the shape of the open-loop singular values will not have changed 
much after robust stabilization. A small value of emax indicates that the 
chosen singular value loop-shapes are incompatible with robust stability 
requirements. That the loop-shapes do not change much following robust 
stabilization if 7 is small (e€ large), is justified theoretically in McFarlane 
and Glover (1990). 


. Analyze the design and if all the specifications are not met make further 


modifications to the weights. 


. Implement the controller. The configuration shown in Figure 9.19 has been 


found useful when compared with the conventional set up in Figure 9.1. 
This is because the references do not directly excite the dynamics of K,,, 


Figure 9.19: A practical implementation of the loop-shaping controller 


which can result in large amounts of overshoot (classical derivative kick). 
The constant prefilter ensures a steady state gain of 1 between r and y, 
assuming integral action in W, or G. 


We will conclude this subsection with a summary of the advantages offered 
by the above H.. loop-shaping design procedure: 
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e It is relatively easy to use, being based on classical loop-shaping ideas. 

e There exists a closed formula for the H.. optimal cost ymin, which in turn 
corresponds to a maximum stability margin emax = 1/7min- 

e No 4-iteration is required in the solution. 

e Except for special systems, ones with all-pass factors, there are no pole- 
zero cancellations between the plant and controller (Sefton and Glover, 
1990; Tsai, Geddes and Postlethwaite, 1992). Pole-zeros cancellations are 
common in many H.. control problems and are a problem when the plant 
has lightly damped modes. 


Exercise 9.7 First a definition and some useful properties. 

Definition: A stable transfer function matriz H(s) is inner if H*H = TI, and 
co-inner if HH* =T. 

Properties: The Ho norm is invariant under right multiplication by a co-inner 
function and under left multiplication by an inner function. 

Equipped with the above definition and properties, show for the shaped Gs = 
M;'N,, that the matrix [ M, Nz ] is co-inner and hence that the Hoo loop- 
shaping cost function 


il Ks (I —GsKs)~'M;' (9.78) 


I 


J 
is equivalent to 


o “ 
where S, = (I —G;K,)~'. This shows that the ies of finding a stabilizing 
controller to minimise 4-block cost function (9.79) has an exact solution. 


Whilst it is highly desirable, from a computational point of view, to have 
exact solutions for Ho. optimization problems, such problems are rare. We 
are fortunate that the above robust stabilization problem is also one of great 
practical significance. 


9.4.3. Two degrees-of-freedom controllers 


Many control design problems possess two degrees-of-freedom: on the one 
hand, measurement or feedback signals and on the other, commands or 
references. Sometimes, one degree-of-freedom is left out of the design, and 
the controller is driven (for example) by an error signal i.e. the difference 
between a command and the output. But in cases where stringent time- 
domain specifications are set on the output response, a one degree-of-freedom 
structure may not be sufficient. A general two degrees-of-freedom feedback 
control scheme is depicted in Figure 9.20. The commands and feedbacks enter 
the controller separately and are independently processed. 
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Controller 


Figure 9.20: General two degrees-of-freedom feedback control scheme 


The H.» loop-shaping design procedure of McFarlane and Glover is a one 
degree-of-freedom design, although as we showed in Figure 9.19 a simple 
constant prefilter can easily be implemented for steady-state accuracy. For 
many tracking problems, however, this will not be sufficient and a dynamic two 
degrees-of-freedom design is required. In (Hoyle, Hyde and Limebeer, 1991) 
and (Limebeer et al., 1993) a two degrees-of-freedom extension of the Glover- 
McFarlane procedure was proposed to enhance the model-matching properties 
of the closed-loop. With this the feedback part of the controller is designed 
to meet robust stability and disturbance rejection requirements in a manner 
similar to the one degree-of-freedom loop-shaping design procedure except 
that only a precompensator weight W is used. It is assumed that the 
measured outputs and the outputs to be controlled are the same although 
this assumption can be removed as shown later. An additional prefilter part 
of the controller is then introduced to force the response of the closed-loop 
system to follow that of a specified model, Tyep, often called the reference 
model. Both parts of the controller are synthesized by solving the design 
problem illustrated in Figure 9.21. 


Figure 9.21: Two degrees-of-freedom 71., loop-shaping design problem 


The design problem is to find the stabilizing controller K = [ ky Ke ] 
for the shaped plant G; = GW,, with a normalized coprime factorization 
G, = M;1Nz, which minimizes the Ho norm of the transfer function between 


the signals [ rl ot it and [ ul yt eF ile as defined in Figure 9.21. 
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The problem is easily cast into the general control configuration and solved 
suboptimally using standard methods and y-iteration. We will show this later. 
The control signal to the shaped plant u, is given by 


Us = [Ki Ke | Bi (9.80) 
y 

where Ky is the prefilter, Ka is the feedback controller, 3 is the scaled 
reference, and y is the measured output. The purpose of the prefilter is to 
ensure that 

|(Z — G.K2)'G.Ky — Tretlloo < yp? (9.81) 
Trep is the desired closed-loop transfer function selected by the designer to 
introduce time-domain specifications (desired response characteristics) into 
the design process; and p is a scalar parameter that the designer can increase 
to place more emphasis on model matching in the optimization at the expense 
of robustness. 

From Figure 9.21 and a little bit of algebra, we have that 


Us p(T — KoG s) aa Ko(I —G,K2)1M;1 

y | =| pI-G,K2)G,K1 (I —G,K)-1M> . 

7 p? [(I —GeK2) "GsK1 — Trt] p(E — Geo) M>? @ 
(9.82) 


In the optimization, the H. norm of this block matrix transfer function is 
minimized. 

Notice that the (1,2) and (2,2) blocks taken together are associated with 
robust stabilization and the (3,1) block corresponds to model-matching. In 
addition, the (1,1) and (2,1) blocks help to limit actuator usage and the 
(3,3) block is linked to the robustness of the loop. For p = 0, the problem 
reverts to minimizing the #1. norm of the transfer function between ¢ and 
[ uly? i namely, the robust stabilization problem, and the two degrees- 
of-freedom controller reduces to an ordinary H.. loop-shaping controller. 

To put the two degrees-of-freedom design problem into the standard control 
configuration, we can define a generalized plant P by 


Us 
é Py P : 
11 12 
Hale | ee ea | ot vee 
B Us 
y 
0 0 | I 
0 M;} i Gs r 
= | —e'Tre pMz' 1 pGs || ¢- (9.84) 
a ne oe ee tere 
0 M;1 ' Gs 
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Further, if the shaped plant G, and the desired stable closed-loop transfer 
function T,er have the following state-space realizations 


$ A, | Bs 
G, = CG, 1D, | (9.85) 
$ A, B, 
Tret = | C. D, | (9.86) 
then P may be realized by 
at 0 O> GDF + Z,CT R71 B, 
0 A, B, 0 | 0 
0 0 0 0 I 
Cc, 0 0 7 ' Ds |(9.87) 
pC, —p’C, | —p?D, Ree ! pDs 
0 0 pl 0 | 0 
GC, 0 0 Ri? | Ds 


and used in standard H., algorithms (Doyle et al., 1989) to synthesize the 
controller K. Note that R, = I+ D,DT, and Z, is the unique positive 
definite solution to the generalized Riccati equation (9.67) for G,. MATLAB 
commands to synthesize the controller are given in Table 9.3. 


Remark 1 We stress that we here aim at minimizing the 1. norm of the entire 
transfer function in (9.82). An alternative problem would be to minimize the Ho 
norm form r to e subject to an upper bound on ||[An, Ax, | |loo. This problem 
would involve the structured singular value, and the optimal controller could be 
obtained from solving a series of 7{..optimization problems using D K-iteration, see 
Section 8.12. 


Remark 2 Extra measurements. In some cases, a designer has more plant 
outputs available as measurements than have to be controlled. These extra 
measurements can often make the design problem easier (e.g. velocity feedback) and 
therefore when beneficial should be used by the feedback controller K2. This can be 
accommodated in the two degrees-of-freedom design procedure by introducing an 
output selection matrix W,. This matrix selects from the output measurements y 
only those which are to be controlled and hence included in the model-matching part 
of the optimization. In Figure 9.21, W. is introduced between y and the summing 
junction. In the optimization problem, only the equation for the error e is affected, 
and in the realization (9.87) for P one simply replaces pC, by pW.C; and pRi? by 
pW.Ri? in the fifth row. For example, if there are four feedback measurements and 
only the first three are to be controlled then 


100 0 
Wo=|0 1 0 0 (9.88) 
001 0 
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Table 9.3: MATLAB commands to synthesize the H.. 2-DOF controller 


in (9.80) 

%, Uses MATLAB mu toolbox 

h 

% INPUTS: Shaped plant Gs 

vA Reference model Tref 

h 

¥, OUTPUT: Two degrees-of-freedom controller K 
h 

vA Coprime factorization of Gs 

h 


[As,Bs,Cs,Ds] = unpck(Gs); 

[Ar,Br,Cr,Dr] = unpck(Tref) ; 

[nr,nr] = size(Ar); [lr,mr] size(Dr); 

[ns,ns] = size(As); [ls,ms] size(Ds); 

Rs eye(1s)+Ds*Ds.’; Ss = eye(ms)+Ds’*Ds; 

At (As - Bs*inv(Ss)*Ds’*Cs); 

Ri Cs’*inv(Rs)*Cs; Qi = Bs*inv(Ss)*Bs’; 

[Z1, Z2, fail, reig min] = ric_schr([A1’ -R1; -Q1 -A1]); Zs = 22/Z1; 
% Alt. Robust toolbox: [Z1,Z2,eig,zerr,wellposed,Zs] = aresolv(A1’,Q1,R1); 
y 

h 

% Set rho to 1 and build the generalized plant P in (9.87) 


A = daug(As, Ar); 
Bi = [zeros(ns,mr) ((Bs*Ds’)+(Zs*Cs’))*inv(sqrt (Rs) ); 
Br zeros(nr,1s)]; 


B2 = [Bs;zeros(nr,ms)]; 

Ci = [zeros(ms,nstnr);Cs zeros(1s,nr);rho*Cs -rho*rho*Cr] ; 

C2 = [zeros(mr,nstnr);Cs zeros(1s,nr)]; 

Dii = [zeros(ms,mr+ls);zeros(1s,mr) sqrt(Rs);-rho*rho*Dr rho*sqrt (Rs) ]; 
D1i2 = [eye(ms) ;Ds;rho*Ds] ; 

D21 = [rhoxeye(mr) zeros(mr,ls);zeros(1s,mr) sqrt(Rs)]; 

D22 = [zeros(mr,ms);Ds]; 


B = [Bi B2]; C = [C1;C2]; D = [D11 D12;D21 D22]; 

P = pcek(A,B,C,D); 

% Alternative: Use sysic to generate P from Figure 9.21 

% but may get extra states, since states from Gs enter twice. 


¥, Gamma iterations to obtain H-infinity controller 


[11,m2] = size(D1i2); [12,m1] = size(D21); 

mmeas = 12; ncon = m2; gmin = 1; gmax = 5; gtol = 0.01; 

[K, Gnclp, gam] = hinfsyn(P, nmeas, ncon, gmin, gmax, gtol); 
% Alt. Robust toolbox: hinfopt 

y 

vA 
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Remark 3 Steady-state gain matching. The command signals r can be scaled 
by a constant matrix W; to make the closed-loop transfer function from r to the 
controlled outputs Woy match the desired model T;es exactly at steady-state. This 
is not guaranteed by the optimization which aims to minimize the oo-norm of the 
error. The required scaling is given by 


Wi & [WoT — Gs(0)K2(0))"1@(0) K1(0)] > 


: Trot (0) (9.89) 


Recall that W, = I if there are no extra feedback measurements above those that 
are to be controlled. The resulting controller is K = [ KiW; Ke |. 


We will conclude this subsection with a summary of the main steps required 
to synthesize a two degrees-of-freedom H,. loop-shaping controller. 


1. Design a one degree-of-freedom H. loop-shaping controller using the 
procedure of subsection 9.4.2, but without a post-compensator weight Wo. 
Hence W. 

2. Select a desired closed-loop transfer function Tier between the commands 
and controlled outputs. 

3. Set the scalar parameter p to a small value greater than 1; something in 
the range 1 to 3 will usually suffice. 

4. For the shaped plant G; = GW,, the desired response Tyer, and the scalar 
parameter p, solve the standard H. optimization problem defined by P 
in (9.87) to a specified tolerance to get K = [ K, Ky |. Remember to 
include W, in the problem formulation if extra feedback measurements are 
to be used. 

5. Replace the prefilter Ay, by KiW; to give exact model-matching in the 
steady-state. 

6. Analyse and, if required, redesign making adjustments to p and possibly 
W, and Tree. 


The final two degrees-of-freedom H..loop-shaping controller is illustrated 
in Figure 9.22 


Controller 


Figure 9.22: Two degrees-of-freedom 7{.loop-shaping controller 
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9.4.4 Observer-based structure for 7/,, loop-shaping 
controllers 


H.. designs exhibit a separation structure in the controller. As seen from 
(9.50) and (9.51) the controller has an observer/state feedback structure, 
but the observer is non-standard, having a disturbance term (a ‘worst’ 
disturbance) entering the observer state equations. For H. loop-shaping 
controllers, whether of the one or two degrees-of-freedom variety, this extra 
term is not present. The clear structure of these controllers has several 
advantages: 


e It is helpful in describing a controller’s function, especially to one’s 
managers or clients who may not be familiar with advanced control. 

e It lends itself to implementation in a gain-scheduled scheme, as shown by 
Hyde and Glover (1993). 

e It offers computational savings in digital implementations and some multi- 
mode switching schemes, as shown in (Samar, Postlethwaite, Gu, Murad 
and Choi, 1994). 


We will present the controller equations, for both one and two degrees- 
of-freedom H.. loop-shaping designs. For simplicity we will assume the 
shaped plant is strictly proper, with a stabilizable and detectable state-space 


realization A 
$ s B; 
o. $e (9.90) 


In which case, as shown in (Sefton and Glover, 1990), the single degree-of- 
freedom H. loop-shaping controller can be realized as an observer for the 
shaped plant plus a state-feedback control law. The equations are 


@, = As®,+H,(C.8,— Ys) + Botts (9.91) 
Us = Ks (9.92) 


where %, is the observer state, us and ys; are respectively the input and output 
of the shaped plant, and 


Heo=-=Z2CF (9.93) 
Ky SS SB eae 1G (9.94) 


where Z, and Xz, are the appropriate solutions to the generalized algebraic 
Riccati equations for G', given in (9.67) and (9.68). 

In Figure 9.23, an implementation of an observer-based H.. loop-shaping 
controller is shown in block diagram form. The same structure was used by 
Hyde and Glover (1993) in their VSTOL design which was scheduled as a 
function of aircraft forward speed. 
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y 


Figure 9.23: An implementation of an 7(., loop-shaping controller for use when 
gain scheduling against a variable v 


Walker (1994) has shown that the two degrees-of-freedom H,. loop-shaping 
controller also has an observer-based structure. He considers a stabilizable and 
detectable plant 


s | As | Bs 
o. 8 [ete (9.95) 
and a desired closed-loop transfer function 
s | Ar | B, 
Tret = | C10 | (9.96) 
in which case the generalized plant P(s) in (9.87) simplifies to 
Ag 0 0 Z,CT B, 
A B 
ps (9.97) 
& (9.98) 
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Walker then shows that a stabilizing controller K = [ Kk, Ke ] satisfying 
|Fi(P, K)\loo < ¥ exists if, and only if, 


(i) y > V1+p?, and 


(ii) Xo. > 0 is a solution to the algebraic Riccati equation 


X A+ ATX. + CTC, — FT (DT ID)F =0 (9.99 
such that Re 4; [A + BF] < 0 where 
Bo= (DP ID) (DIC +B xX) (9.100 
ay Dy Die 
D = Be ‘ (9.101 
= L, 0 
J = E | (9.102 


where [, and [,, are unit matrices of dimensions equal to those of the 
error signal z, and exogenous input w, respectively, in the standard 
configuration. 


Notice that this 71.. controller depends on the solution to just one algebraic 
Riccati equation, not two. This is a characteristic of the two degrees-of- 
freedom H., loop-shaping controller (Hoyle et al., 1991). 

Walker further shows that if (i) and (ii) are satisfied, then a stabilizing 
controller K(s) satisfying ||Fi(P, K)|loo <7 has the following equations: 


Z = As®s + Hs(Cs¥s — ys) + Botts (9.103) 
tp, = A,@,+ Br (9.104) 
us = —BIXou8,— BOX x18, (9.105) 


where X51; and X..12 are elements of 


Xoo1t Xo012 
Xo = 9.106 
| Xo021 X 022 ( ) 
which has been partitioned conformally with 
As 0 
A= 0 A, (9.107) 


and H, is as in (9.93). 
The structure of this controller is shown in Figure 9.24, where the state- 
feedback gain matrices F and F; are defined by 


FS BT Xeon FS BT X12 (9.108) 
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The controller consists of a state observer for the shaped plant G, a model 
of the desired closed-loop transfer function Tyrer (without C;.) and a state- 
feedback control law that uses both the observer and reference-model states. 


1 jnatudseashcednmecteceseuetncecaetdndempacetecesnsenssmmaacs output from 
H ‘ shaped plant 


Ys 


input to : 
shaped plant: 
Us } 


‘ commands 
ir 


Figure 9.24: Structure of the two degrees-of-freedom H.. loop-shaping controller 


As in the one degree-of-freedom case, this observer-based structure is useful 
in gain-scheduling. The reference-model part of the controller is also nice 
because it is often the same at different design operating points and so may 
not need to be changed at all during a scheduled operation of the controller. 
Likewise, parts of the observer may not change; for example, if the weight 
W,(s) is the same at all the design operating points. Therefore whilst the 
structure of the controller is comforting in the familiarity of its parts, it also 
has some significant advantages when it comes to implementation. 


9.4.5 Implementation issues 


Discrete-time controllers. For implementation purposes, discrete-time 
controllers are usually required. These can be obtained from a continuous- 
time design using a bilinear transformation from the s-domain to the z- 
domain, but there can be advantages in being able to design directly in 
discrete-time. In Samar et al. (1994; 1995) and Postlethwaite, Samar, Choi 
and Gu (1995), observer-based state-space equations are derived directly in 
discrete-time for the two degrees-of-freedom H.. loop-shaping controller and 
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successfully applied to an aero engine. This application was on a real engine, 
a Sey engine, which is a Rolls Royce 2-spool reheated turbofan housed at 
the UK Defence Research Agency, Pyestock. As this was a real application, a 
number of important implementation issues needed to be addressed. Although 
these are outside the general scope of this book, they will be briefly mentioned 
now. 

Anti-windup. In H. loop-shaping the pre-compensator weight W; would 
normally include integral action in order to reject low frequency disturbances 
acting on the system. However, in the case of actuator saturation the 
integrators continue to integrate their input and hence cause windup problems. 
An anti-windup scheme is therefore required on the weighting function W,. 
The approach we recommend is to implement the weight W in its self- 
conditioned or Hanus form. Let the weight W, have a realization 


(9.109) 


and let u be the input to the plant actuators and u, the input to the shaped 
plant. Then u = W,u,. When implemented in Hanus form, the expression for 
u becomes (Hanus, Kinnaert and Henrotte, 1987) 


U= 


Ae = Bede C4 ||| 0° Bede! Us 
om | a | | . (9.110) 


where wu, is the actual plant input, that is the measurement at the output 
of the actuators which therefore contains information about possible actuator 
saturation. The situation is illustrated in Figure 9.25, where the actuators 
are each modelled by a unit gain and a saturation. The Hanus form prevents 
windup by keeping the states of W, consistent with the actual plant input at 
all times. When there is no saturation u, = u, the dynamics of W, remain 
unaffected and (9.110) simplifies to (9.109). But when u, 4 u the dynamics 
are inverted and driven by ua so that the states remain consistent with the 
actual plant input ua. Notice that such an implementation requires W, to be 
invertible and minimum phase. 


Exercise 9.8 Show that the Hanus form of the weight W, in (9.110) simplifies to 
(9.109) when there is no saturation i.e. when Ua = wu. 


Bumpless transfer. In the aero-engine application, a multi-mode 
switched controller was designed. This consisted of three controllers, each 
designed for a different set of engine output variables, which were switched 
between depending on the most significant outputs at any given time. To 
ensure smooth transition from one controller to another - bumpless transfer - 
it was found useful to condition the reference models and the observers in each 
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actuator 
saturation 


Figure 9.25: Self-conditioned weight W; 


of the controllers. Thus when on-line, the observer state evolves according to 
an equation of the form (9.103) but when off-line the state equation becomes 


@ = Ash, + He(Co®s — ys) + Bettas (9.111) 


where was is the actual input to the shaped plant governed by the on-line 
controller. The reference model with state feedback given by (9.104) and 
(9.105) is not invertible and therefore cannot be self-conditioned. However, in 
discrete-time the optimal control also has a feed-through term from r which 
gives a reference model that can be inverted. Consequently, in the aero-engine 
example the reference models for the three controllers were each conditioned 
so that the inputs to the shaped plant from the off-line controller followed the 
actual shaped plant input was given by the on-line controller. 

Satisfactory solutions to implementation issues such as those discussed 
above are crucial if advanced control methods are to gain wider acceptance in 
industry. We have tried to demonstrate here that the observer-based structure 
of the H.. loop-shaping controller is helpful in this regard. 


9.5 Conclusions 


We have described several methods and techniques for controller design, but 
our emphasis has been on H... loop shaping which is easy to apply and in our 
experience works very well in practice. It combines classical loop-shaping ideas 
(familiar to most practising engineers) with an effective method for robustly 
stabilizing the feedback loop. For complex problems, such as unstable plants 
with multiple gain crossover frequencies, it may not be easy to decide on 
a desired loop shape. In which case, we would suggest doing an initial LQG 
design (with simple weights) and using the resulting loop shape as a reasonable 
one to aim for in H. loop shaping. 

An alternative to #1. loop shaping is a standard H,.. design with a 
“stacked” cost function such as in S/KS mixed-sensitivity optimization. In this 
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approach, 7. optimization is used to shape two or sometimes three closed- 
loop transfer functions. However, with more functions the shaping becomes 
increasingly difficult for the designer. 

In other design situations where there are several performance objectives 
(e.g. on signals, model following and model uncertainty), it may be more 
appropriate to follow a signal-based Hz or Hoapproach. But again the 
problem formulations become so complex that the designer has little direct 
influence on the design. 

After a design, the resulting controller should be analyzed with respect to 
robustness and tested by nonlinear simulation. For the former, we recommend 
i-analysis as discussed in Chapter 8, and if the design is not robust, then 
the weights will need modifying in a redesign. Sometimes one might consider 
synthesizing a p-optimal controller, but this complexity is rarely necessary in 
practice. Moreover, one should be careful about combining controller synthesis 
and analysis into a single step. The following quote from Rosenbrock (1974) 
illustrates the dilemma: 


In synthesis the designer specifies in detail the properties which 
his system must have, to the point where there is only one possible 
solution. .... The act of specifying the requirements in detail 
implies the final solution, yet has to be done in ignorance of this 
solution, which can out to be unsuitable in ways that were not 
foreseen. 


Therefore, control system design usually proceeds iteratively through the steps 
of modelling, control structure design, controllability analysis, performance 
weight selection, controller synthesis, control system analysis and nonlinear 
simulation. Rosenbrock (1974) makes the following observation: 


Solutions are constrained by so many requirements that it is 
virtually impossible to list them all. The designer finds himself 
threading a maze of such requirements, attempting to reconcile 
conflicting demands of cost, performance, easy maintenance, and 
so on. A good design usually has strong aesthetic appeal to those 
who are competent in the subject. 
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10 


CONTROL STRUCTURE 
DESIGN 


Most (if not all) available control theories assume that a control structure is given 
at the outset. They therefore fail to answer some basic questions which a control 
engineer regularly meets in practice: Which variables should be controlled. Which 
variables should be measured, which inputs should be manipulated, and which links 
should be made between them? The objective of this chapter is to describe the 
main issues involved in control structure design and to present some of the available 
quantitative methods, for example, for decentralized control. 


10.1 Introduction 


Control structure design was considered by Foss (1973) in his paper entitled 
“Critique of process control theory” where he concluded by challenging 
the control theoreticians of the day to close the gap between theory and 
applications in this important area. Later Morari, Arkun and Stephanopoulos 
(1980) presented an overview of control structure design, hierarchical control 
and multilevel optimization in their paper “Studies in the synthesis of control 
structure for chemical processes”, but the gap still remained, and still does to 
some extent today. 

Control structure design is clearly important in the chemical process 
industry because of the complexity of these plants, but the same issues are 
relevant in most other areas of control where we have large-scale systems. 
For example, in the late 1980s Carl Nett (Nett, 1989; Nett and Minto, 1989) 
gave a number of lectures based on his experience on aero-engine control at 
General Electric, under the title “A quantitative approach to the selection and 
partitioning of measurements and manipulations for the control of complex 
systems”. He noted that increases in controller complexity unnecessarily 
outpaces increases in plant complexity, and that the objective should be to 


... minimize control system complexity subject to the achievement 
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of accuracy specifications in the face of uncertainty. 


In Chapter 1 (page 1), we described the typical steps taken in the process 
of designing a control system. Steps 4, 5, 6 and 7 are associated with the 
following tasks of control structure design: 


1. The selection of controlled outputs (a set of variables which are to be 
controlled to achieve a set of specific objectives; see section 10.3). 

2. The selection of manipulations and measurements (sets of variables which 
can be manipulated and measured for control purposes; see section 10.4). 

3. The selection of a control configuration (a structure interconnecting 
measurements/commands and manipulated variables; see sections 10.6, 
10.7 and 10.8). 

4. The selection of a controller type (control law specification, e.g. PID- 
controller, decoupler, LQG, etc). 


The distinction between the words control structure and control 
configuration may seem minor, but note that it is significant within the context 
of this book. The control structure (or control strategy) refers to all structural 
decisions included in the design of a control system. On the other hand, the 
control configuration refers only to the structuring (decomposition) of the 
controller K itself (also called the measurement/manipulation partitioning 
or input/output pairing). Control configuration issues are discussed in more 
detail in Section 10.6. 

The selection of controlled outputs, manipulations and measurements (tasks 
1 and 2 combined) is sometimes called input/output selection. 

Ideally, the tasks involved in designing a complete control system are 
performed sequentially; first a “top-down” selection of controlled outputs, 
measurements and inputs (with little regard to the configuration of the 
controller A’) and then a “bottom-up” design of the control system (in which 
the selection of the control configuration is the most important decision). 
However, in practice the tasks are closely related in that one decision directly 
influences the others, so the procedure may involve iteration. 

One important reason for decomposing the control system into a specific 
control configuration is that it may allow for simple tuning of the 
subcontrollers without the need for a detailed plant model describing the 
dynamics and interactions in the process. Multivariable centralized controllers 
may always outperform decomposed (decentralized) controllers, but this 
performance gain must be traded off against the cost of obtaining and 
maintaining a sufficiently detailed plant model. 

The number of possible control structures shows a combinatorial growth, 
so for most systems a careful evaluation of all alternative control structures 
is impractical. Fortunately, we can often from physical insight obtain a 
reasonable choice of controlled outputs, measurements and manipulated 
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inputs. In other cases, simple controllability measures as presented in Chapters 
5 and 6 may be used for quickly evaluating or screening alternative control 
structures. 

Some discussion on control structure design in the process industry is given 
by Morari (1982), Shinskey (1988), Stephanopoulos (1989) and Balchen and 
Mumme (1988). A survey on control structure design is given by van de Wal 
and de Jager (1995). The reader is referred to Chapter 5 (page 170) for an 
overview of the literature on input-output controllability analysis. 


10.2. Optimization and control 


The selection of controlled outputs (task 1) involves selecting the variables 
y to be controlled at given reference values, y ~ r. Here the reference value 
r is set at some higher layer in the control hierarchy. Thus, the selection of 
controlled outputs (for the control layer) is usually intimately related to the 
hierarchical structuring of the control system which is often divided into two 
layers: 


e optimization layer — computes the desired reference commands r 
e control layer — implements these commands to achieve y  r. 


Additional layers are possible, as is illustrated in Figure 10.1 which shows 
a typical control hierarchy for a complete chemical plant. Here the control 
layer is subdivided into two layers: supervisory control (“advanced control” ) 
and regulatory control (“base control”). We have also included a scheduling 
layer above the optimization. In general, the information flow in such a control 
hierarchy is based on the higher layer sending reference values (setpoints) to 
the layer below, and the lower layer reporting back any problems in achieving 
this, see Figure 10.2(b). 

The optimization tends to be performed open-loop with limited use of 
feedback. On the other hand, the control layer is mainly based on feedback 
information. The optimization is often based on nonlinear steady-state models, 
whereas we often use linear dynamic models in the control layer (as used 
throughout the book). 

There is usually a time scale separation with faster lower layers as indicated 
in Figure 10.1. This means that the setpoints, as viewed from a given layer 
in the hierarchy, are updated only periodically. Between these updates, when 
the setpoints are constant, it is important that the system remains reasonably 
close to its optimum. This observation is the basis for Section 10.3 which deals 
with selecting outputs for the control layer. 

From a theoretical point of view, the optimal coordination of the inputs 
and thus the optimal performance is obtained with a centralized optimizing 
controller, which combines the two layers of optimization and control; see 
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Figure 10.1: Typical control system hierarchy in a chemical plant 
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Controller 


Ym 


(a) (b) (c) 


Figure 10.2: Alternative structures for optimization and control. (a) Open-loop 
optimization. (b) Closed-loop implementation with separate control layer. (c) 
Integrated optimization and control. 
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Figure 10.2(c). All control actions in such an ideal control system would 
be perfectly coordinated and the control system would use on-line dynamic 
optimization based on a nonlinear dynamic model of the complete plant 
instead of, for example, infrequent steady-state optimization. However, this 
solution is normally not used for a number of reasons; including the cost of 
modelling, the difficulty of controller design, maintenance and modification, 
robustness problems, operator acceptance, and the lack of computing power. 

As noted above we may also decompose the control layer, and from now 
on when we talk about control configurations, hierarchical decomposition and 
decentralization, we generally refer to the control layer. We will return to this 
topic in Section 10.6. 

Mesarovic (1970) reviews some ideas related to on-line multi-layer 
structures applied to large-scale industrial complexes. However, according 
to Lunze (1992), multilayer structures, although often used empirically in 
practice, are so far lacking a formal analytical treatment. Nevertheless, in the 
next section we provide some ideas on how to select objectives (controlled 
outputs) for the control layer, such that the overall goal is satisfied. 


Remark. In accordance with Lunze (1992) we have purposely used the word layer 
rather than level for the hierarchical decomposition of the control system. The 
difference is that in a multilevel system all units contribute to satisfying the same 
goal, whereas in a multilayer system the different units have different objectives 
(which preferably contribute to the overall goal). Multilevel systems have been 
studied in connection with the solution of optimization problems. 


10.3 Selection of controlled outputs 


A controlled output is an output variable (usually measured) with an 
associated control objective (usually a reference value). In many cases, it 
is clear from a physical understanding of the process what the controlled 
outputs should be. For example, if we consider heating or cooling a room, 
then we shall select room temperature as the controlled output y. In other 
cases it is less obvious because each control objective may not be associated 
with a measured output variable. Then the controlled outputs y are selected to 
achieve the overall system goal (purpose), and may not appear to be important 
variables in themselves. Also the selection of controlled outputs is usually 
related to a hierarchical decomposition of the control system; typically into 
an optimization and control layer. 


Example 10.1 Cake baking. To get an idea of the issues involved in output 
selection let us consider the process of baking a cake. The overall goal is to make a 
cake which is well baked inside and with a nice exterior. The manipulated input for 
achieving this is the heat input, u=Q, (and we will assume that the duration of the 
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baking is fired, e.g. at 15 minutes). Now, if we had never baked a cake before, and if 
we were to construct the stove ourselves, we might consider directly manipulating the 
heat input to the stove, possibly with a watt-meter measurement. However, this open- 
loop implementation would not work well, as the optimal heat input depends strongly 
on the particular oven we use, and the operation is also sensitive to disturbances, for 
example, from opening the oven door or what else might be in the oven Therefore, in 
practice we look up the optimal oven temperature in a cook book, and use a closed- 
loop implementation where a thermostat is used to keep the temperature y at its 
predetermined value T. 

The (a) open-loop and (b) closed-loop implementations of the cake baking process 
are illustrated in Figure 10.2. In (b) the “optimizer” is the cook book which has 
a pre-computed table of the optimal temperature profile. The reference value r for 
temperature is then sent down to the control layer which consists of a simple feedback 
controller (the thermostat). 


Recall that the title of this section is selection of controlled outputs. In the 
cake baking process we select oven temperature as the controlled output y in 
the control layer. It is interesting to note that controlling the oven temperature 
in itself has no direct relation to the overall goal of making a well-baked 
cake. So why do we select the oven temperature as a controlled output? We 
now want to outline an approach for answering questions of this kind. The 
mathematical development is kept mostly at a conceptual level. 

In the following, we let y denote the selected controlled outputs in the 
control layer. Note that this may also include directly using the inputs (open- 
loop implementation) by selecting y = u. Two distinct questions arise: 


1. What variables y should be selected as the controlled variables? 
2. What is the optimal reference value (Yopt) for these variables? 


The second problem is one of dynamic optimization and is extensively studied. 
Here we want to gain some insight into the first problem. We make the 
following assumptions: 


(a) The overall goal can be quantified in terms of a scalar cost function J 
which can be minimized. 

(b) For a given disturbance d there exists an optimal value uopt(d), and 
corresponding value Yopt (d), which minimizes the cost function J. 

(c) The reference values r for the controlled outputs y should be constant, 
i.e. r should be independent of the disturbances d. Typically, some 


average value is selected, e.g. r = Yopt(d) 


For example, in the cake baking process we may assign to each cake a 
number P on a scale from 0 to 10, based on cake quality. A perfectly baked 
cake achieves P = 10, and an acceptably baked cake achieves P > 6 (a 
completely burned cake may correspond to P = 1). In another case P could 
be the operating profit. In both cases we can select J = —P, and the overall 
goal of the control system is then to minimize J. 
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The system behaviour is a function of the independent variables u and d, 
so we may write J = J(u,d). For a given disturbance d the optimal value of 
the cost function is 


Jopr(d) & I(ttopt,d) = min J(u, d) (10.1) 


Ideally, we want u = Uopt. However, this will not be achieved in practice, and 
we select controlled outputs y such that: 


e The input u (generated by feedback to achieve y xr) should be close to the 
optimal input Uopt(d). 


Note that we have assumed that r is independent of d. The above statement 
is obvious, but it is nevertheless very useful. The following development aims 
at quantifying the statement. 

What happens if wu # uUopt? Obviously, we then have a loss which can be 
quantified by J — Jopt, and a reasonable objective for selecting controlled 
outputs y is to minimize the worst-case loss 


Worst — case loss : @ 2 max | J(u, d) — J(Uopt, )| (10.2) 
Here D is the set of possible disturbances. As “disturbances” we should also 
include changes in operating point and model uncertainty. To obtain some 
insight into the problem of minimizing the loss ®, let us consider the term 
J(u, d) — Jopt (d) in (10.2), where d is a fixed (generally nonzero) disturbance. 
We make the following additional assumptions: 


(d) The cost function J is smooth, or more precisely twice differentiable. 

(e) The optimization problem is unconstrained. If it is optimal to keep some 
variable at a constraint, then we assume that this is implemented and 
consider the remaining unconstrained problem. 

(f) The dynamics of the problem can be neglected, that is, we consider the 
steady-state control and optimization. 


For a fixed d we may then express J(u, d) in terms of a Taylor series expansion 
of u around the optimal point. We get 


OJ\* 1 Ord 
J(u, d) = Jopt t (Foul Uopt) t 5 iu Uopt)” (52). (u Uopt) Pais (10.3) 


=0 


where we have neglected terms of third order and higher (which assumes 

that we are reasonably close to the optimum). Also, note here that uopt is 
a function of d so we should formally write uop:(d). The second term on the 
right hand side in (10.3) is zero at the optimal point for an unconstrained 
problem. 
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Equation (10.3) quantifies how u — uop affects the cost function. To study 
how this relates to output selection we use a linearized model of the plant, 
which for a fixed d becomes y — Yopt = G(u—Uopt) where G is the steady-state 
gain matrix. If G is invertible we then get 


U— Uopt = Gy = Yopt ) (10.4) 


(if G is not invertible we may use the pseudo-inverse Gt which results in the 
smallest possible ||u — uope||2 for a given y — Yopt), and 
ors 


Ou 


1 nf = 
J — Top ® 5 (G™ (y — yopt))” ( ) G~" (y — yoo) (10.5) 
opt 


where the term (0?J/Ou7)opt is independent of y. Obviously, we want to 
select the controlled outputs such that y — yopt is small. However, this is not 
possible in practice. To see this, write 


Y— Yopt =Y—-T +L — Yopt = € + Copt (10.6) 


First, we have an optimization error €opt = r — Yopt, because the algorithm 
(e.g. a cook book) pre-computes a desired r which is different from the optimal 
Yopt- In addition, we have a control error e = y — r because the control layer 
is not perfect, for example due to poor control performance or an incorrect 
measurement or estimate (steady-state bias) of y. If the control itself is perfect 
then e = n (the measurement noise). In most cases the errors e and €op¢ can 
be assumed independent. 

From (10.5) and (10.6), we conclude that we should select the controlled 
outputs y such that: 


1. G- is small (i.e. G is large); the choice of y should be such that the inputs 
have a large effect on y. 

2. €opt = 1 — Yopt(d) is small; the choice of y should be such that its optimal 
value Yopt(d) depends only weakly on the disturbances and other changes. 

3. e=y-—r is small; the choice of y should be such that tt is easy to keep the 
control error e small. 


Note that ¢(G~') = 1/a(G), and so we want the smallest singular value 
of the steady-state gain matrix to be large (but recall that singular values 
depend on scaling as is discussed below). The desire to have o(G) large is 
consistent with our intuition that we should ensure that the controlled outputs 
are independent of each other. Also note that the desire to have a(G) large 
(and preferably as large as possible) is here not related to the issue of input 
constraints, which was discussed in Section 6.9. 


Example 10.1 Cake baking, continued. Let us return to our initial question: 
Why select the oven temperature as a controlled output? We have two alternatives, 
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a closed-loop implementation with y = T (the oven temperature) and an open-loop 
implementation with y = u = Q (the heat input). From experience, we know that 
the optimal oven temperature Topt is largely independent of disturbances and is 
almost the same for any oven. This means that we may always specify the same 
oven temperature, say T, = 190°C, as obtained from the cook book. On the other 
hand, the optimal heat input Qopt depends strongly on the heat loss, the size of the 
oven, etc, and may vary between, say 100W and 5000W. A cook book would then 
need to list a different value of Q, for each kind of oven and would in addition need 
some correction factor depending on the room temperature, how often the oven door 
is opened, etc. Therefore, we find that it is much easier to keep eopt = T — Topt [°C] 
small than to keep Qr — Qopt [W] small. 

There may also be some difference in the control error e = y—r, but this is 
mainly because most ovens are designed to be operated with temperature feedback. 
Specifically, with a thermostat we can easily keep T close to its desired value. On the 
other hand, with most ovens it is difficult to accurately implement the desired heat 
input Q, because the knob on the stove is only a crude indicator of the actual heat 
input (this difference in control error could have been eliminated by implementing a 
watt-meter). 


To use o(G) to select controlled outputs, we see from (10.5) that we should 
first scale the outputs such that the expected magnitude of y; — yi,,, is similar 
in magnitude for each output, and scale the inputs such that the effect of 
a given deviation uj — uj,,, on the cost function J is similar for each input 
(such that (PF/Ou") oe is close to a constant times a unitary matrix). We 
must also assume that the variations in y — yi, are uncorrelated, or more 
precicely, we must assume: 


(g) The “worst-case” combination of outputs deviations yj — Yin 
correpsonding to the direction of ¢(G), can occur in practice. 


Procedure for selecting controlled outputs. The use of the minimum 
singular value to select controlled outputs may be summarized in the following 
procedure: 


1. From a (nonlinear) model compute the optimal parameters (inputs and 
outputs) for various conditions (disturbances, operating points). (This 
yields a “look-up” table of optimal parameter values as a function of the 
operating conditions.) 

2. From this data obtain for each candidate output the variation in its optimal 
value, Uj = (Yioot.max — Yiopt.min)/2- 

3. Scale the candidate outputs such that for each output the sum of the 
magnitudes of v; and the control error (e.g. measurement noise) is similar 
(e.g. about 1). 

4. Scale the inputs such that a unit deviation in each input from its optimal 
value has the same effect on the cost function J. 
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5. Select as candidates those sets of controlled outputs which correspond to 
a large value of o(G). G is the transfer function for the effect. of the scaled 
inputs on the scaled outputs. 


The aero-engine application in Chapter 12 provides a nice illustration of 
output selection. There the overall goal is to operate the engine optimally in 
terms of fuel consumption, while at the same time staying safely away from 
instability. The optimization layer is a look-up table, which gives the optimal 
parameters for the engine at various operating points. Since the engine at 
steady-state has three degrees-of-freedom we need to specify three variables 
to keep the engine approximately at the optimal point, and five alternative 
sets of three outputs are given. The outputs are scaled as outlined above, and 
a good output set is then one with a large value of (G), provided we can also 
achieve good dynamic control performance. 


Remark 1 In the above procedure for selecting controlled outputs, based on 
maximizing o(G), the variation in yop+(d) with d (which should be small) enters 
into the scaling of the outputs. 


Remark 2 A more exact procedure, which may be used if the optimal outputs are 
correlated such that assumption( 10.3) does not hold, is the following: By solving the 
nonlinear equations, evaluate directly the cost function J for various disturbances d 
and control errors e, assuming y = r+e where r is kept constant at the optimal value 
for the nominal disturbance. The set of controlled outputs with smallest average or 
worst-case value of J is then preferred. This approach is usually more time consuming 
because the solution of the nonlinear equations must be repeated for each candidate 
set of controlled outputs. 


Measurement selection for indirect control 


The above ideas also apply for the case where the overall goal is to keep some 
variable z at a given value (setpoint) z-, e.g. J = ||z — z,||. We assume we 
cannot measure z, and instead we attempt to achieve our goal by controlling 


y at some fixed value r, e.g. r = Yopt(d) where d = 0 if we use deviation 
variables. For small changes we may assume linearity and write 


z=Gu+ Gad (10.7) 


for which z = 2, is obtained with u = uopt(d) where uopt(d) = Gy! (z,-—Gead). 
Furthermore, y = Gu + Gad, so the optimal output is 
Yopt (d) = GG 2, +(Ga — GGz'Gza)d (10.8) 
See 


where one criterion for selecting outputs y is that yopt(d) depends weakly on 
d. 
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Although the above procedure for selecting controlled outputs applies also 
in this simple case, it is easier to consider J = ||z — z,|| directly. Using 
u=G~1+(y— Gad) in (10.7) and introducing y =r +e, where r= GG;!z, (a 
constant) and e is the control error (e.g. due to measurement error), we find 


2— 2p =(Gea-G.G"'Gad+G.G'e (10.9) 
eS ——’” 
P; i 


To minimize ||z—z2|| we should then select controlled outputs such that || Pad] 
and ||P,-e|| are small. Note that Py depends on the scaling of disturbances d 
and “primary” outputs z (and are independent of the scaling of inputs u and 
selected outputs y, at least for square plants). The magnitude of the control 
error e depends on the choice of outputs. Based on (10.9) a procedure for 
selecting controlled outputs may be suggested. 

Procedure for selecting controlled outputs for indirect control. 
Scale the disturbances d to be of magnitude 1 (as usual), and scale the outputs 
y so the expected control error e (measurement noise) is of magnitude 1 for 
each output (this is different from the output scaling used in step 3 in our 
minimum singular value procedure). Then to minimize J we prefer sets of 
controlled outputs which: 


Minimize ||[P: P,]|l (10.10) 


Remark 1 The choice of norm in (10.10) depends on the basis for the scaling, but 
the choice is usually of secondary importance. The maximum singular value arises 
if ||d|]2 <1 and |le||2 < 1, and we want to minimize ||z — z,|l2. 


Remark 2 The above procedure does not require assumption 10.3 of uncorrelated 
variations in the optimal values of y; — yc... 


Remark 3 Of course, for the choice y = z we have that yopt = Zr is independent 
of d and the matrix P, in (10.9) and (10.10) is zero. However, P, is still nonzero. 


Remark 4 In some cases this measurement selection problem involves a trade-off 
between wanting ||Pz|| small (want strong correlation between measured outputs y 
and “primary” outputs z) and wanting ||P,|| small (want the effect of control errors 
(measurement noise) to be small). 


For example, this is the case in a distillation column when we use temperatures 
inside the column (y) for indirect control of the product compositions (z). Here, for 
a high-purity separation, we cannot place the measurement too close to the column 
end due to sensitivity to measurement error (||P,|| becomes large), and not too far 
from the column end due to sensitivity to disturbances (|| Pz||]) becomes large). 


Remark 5 Indirect control is related to the idea of inferential control which is 
commonly used in the process industry. However, usually in inferential control the 
idea is to use the measurement of y to estimate (infer) z and then to control this 
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estimate rather than controlling y directly, e.g. see Stephanopoulos (1989). However, 
there is no universal agreement on these terms, and Marlin (1995) uses the term 
inferential control to mean indirect control as discussed here. 


Remark 6 The problem of indirect control is closely related to that of cascade 
control discussed in Section 10.7.2 (where wu is uz, z is y1, y is yo, G is Goo, etc.), 
see (10.32), where we introduce Py, as the partial disturbance gain matrix. The 
main difference is that in cascade control we also measure and control z (yi) in an 
outer loop. In this case we want ||[ Pa P,]|| small at high frequencies beyond the 
bandwidth of the outer loop involving z (y1). 


Remark 7 One might say that (10.5) and the resulting procedure for output 
selection, generalizes the use of Py and P, from the case of indirect control to the 
more general case of minimizing some cost unction J. 


Summary. In this section we have considered the following problem: 
During operation, the optimal values of all variables will change with time 
(due to disturbances and other changes). For practical reasons, we consider 
a hierarchical strategy where the optimization is performed only periodically. 
The question is then: Which variables (controlled outputs) should be kept 
constant (between each optimization)? Essentially, we found that we should 
select variables y for which the variation in optimal value and control error is 
small compared to their controllable range (the range y may reach by varying 
the input u). This is hardly a big surprise, but it is nevertheless useful and 
provides the basis for our procedure for selecting controlled outputs. 

The objective of the control layer is then to keep the controlled outputs at 
their reference values (which are computed by the optimization layer). The 
controlled outputs are often measured, but we may also estimate their values 
based on other measured variables. We may also use other measurements 
to improve the control of the controlled outputs, for example, by use of 
cascade control. Thus, the selection of controlled and measured outputs are 
two separate issues, although the two decisions are obviously related. The 
measurement selection problem is briefly discussed in the next section. In 
subsection 10.5 we discuss the relative gain array of the “big” transfer matrix 
(with all candidate outputs included), which is a useful screening tool also for 
selecting controlled outputs. 


10.4 Selection of manipulations and 
measurements 

In some cases there are a large number of candidate measurements and/or 

manipulations. The need for control has three origins 


e to stabilize an unstable plant, 
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e to reject disturbances, 
e to track reference changes, 


and the selection of manipulations and measurements is intimately related to 
these. 

For measurements, the rule is to select those which have a strong 
relationship with the controlled outputs, or which may quickly detect a major 
disturbance and which together with manipulations can be used for local 
disturbance rejection. 

The selected manipulations should have a large effect on the controlled 
outputs, and should be located “close” (in terms of dynamic response) to the 
outputs and measurements. If the plant is unstable, then the manipulations 
must be selected such that the unstable modes are state controllable, and 
the measurements must be selected such that the unstable modes are state 
observable. 

For a more formal analysis we may consider the model yan = GanUan + 
Gaaud. Here 


© yan = all candidate outputs (measurements) 
@ Uan = all candidate inputs (manipulations) 


The model for a particular combination of inputs and outputs is then y = 
Gu + Gad where 


G = SoGanS7; Ga = SoG aa (10.11) 


Here So is a non-square input “selection” matrix with a 1 and otherwise 
0’s in each row, and $7 is a non-square output “selection” matrix with a 1 
and otherwise 0’s in each column. For example, with So = I all outputs are 
selected, and with So =[0 I] output 1 has not been selected. 

To evaluate the alternative combinations, one may, based on G and Gg, 
perform an input-output controllability analysis as outlined in Chapter 6 
for each combination (e.g, consider the minimum singular value, RHP-zeros, 
interactions, etc). At least this may be useful for eliminating some alternatives. 
A more involved approach, based on analyzing achievable robust performance 
by neglecting causality, is outlined by Lee, Braatz, Morari and Packard (1995). 
This approach is more involved both in terms of computation time and in 
the effort required to define the robust performance objective. An even more 
involved (and exact) approach would be to synthesize controllers for optimal 
robust performance for each candidate combination. 

However, the number of combinations has a combinatorial growth, so even 
a simple input-output controllability analysis becomes very time-consuming 
if there are many alternatives. For a plant where we want to select m from M 
candidate manipulations, and | from L candidate measurements, the number 


CONTROL STRUCTURE DESIGN 437 


of possibilities is 


(7) (m) 7 i DI aa = (10.12) 


A few examples: Form =1=1 and M = L = 2 the number of possibilities is 
4, form =l=2and M =L = 4 it is 36, form =1=5 and M = L = 10 it is 
63504, and form = M,1=5 and L = 100 (selecting 5 measurements out of 
100 possible) there are 75287520 possible combinations. 


Remark. The number of possibilities is much larger if we consider all possible 
combinations with 1 to M inputs and 1 to LE outputs. The number is (Nett, 1989): 
yaa ey (7) (™) For example, with M = L = 2 there are 4+2+2+1=9 
candidates (4 structures with one input and one output, 2 structures with two inputs 
and one output, 2 structures with one input and two outputs, and 1 structure with 
two inputs and two outputs). 


One way of avoiding this combinatorial problem is to base the selection 
directly on the “big” models Gan and G'aan. For example, one may consider the 
singular value decomposition and relative gain array of G', as discussed in the 
next section. This rather crude analysis may be used, together with physical 
insight, rules of thumb and simple controllability measures, to perform a 
pre-screening and reduce the possibilities to a manageable number. These 
candidate combinations can then be analyzed more carefully. 


10.5 RGA for non-square plant 


A simple but effective screening tool for selecting inputs and outputs, which 
avoids the combinatorial problem just mentioned, is the relative gain array 
(RGA) of the “big” transfer matrix Gan with all candidate inputs and outputs 
included, A = Gan x Gt. 

Essentially, for the case of many candidate manipulations (inputs) one may 
consider not using those manipulations corresponding to columns in the RGA 
where the sum of the elements is much smaller than 1 (Cao, 1995). Similarly, 
for the case of many candidate measured outputs (or controlled outputs) one 
may consider not using those outputs corresponding to rows in the RGA where 
the sum of the elements is much smaller than 1. 

To see this, write the singular value decomposition of Gan as 


Gan = ULV" =U,2,VF (10.13) 


where ©, consists only of the r = rank(G) nonzero singular values, U,. consists 
of the r first columns of U, and V, consists of the r first columns of V. Thus, 
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V, consists of the input directions with a nonzero effect on the outputs, and 
U, consists of the output directions we can affect (reach) by use of the inputs. 

Lete;=[0 --- 0 1 0 ::: 0]" be a unit vector with a 1 in position 
j and 0’s elsewhere. Then the j’th input is u; = e; U. Define e; in a similar 
way such that the i’th output is y; = e7 y. We then have that ej Vr yields 
the projection of a unit input u; onto the effective input space of G, and we 


follow Cao (1995) and define 
Projection for input 7 = lle; Ve ll2 (10.14) 


which is a number between 0 and 1. Similarly, e7U, yields the projection of a 
unit output y; onto the effective (reachable) output space of G, and we define 


Projection for output i = |le7 U;|l2 (10.15) 


which is a number between 0 and 1. The following theorem links the input and 
output (measurement) projection to the column and row sums of the RGA. 


Theorem 10.1 (RGA and input and output projections.) The i’th row 
sum of the RGA is equal to the square of the i’th output projection, and the 
jth column sum of the RGA is equal to the square of the jth input projection, 
1.€. 

m l 

Do Aig = MlePUrllbs ST Ag = lleP Vellb (10.16 

j=l 


t=1 


Proof: See Appendix A.4.2. 


The RGA is a useful screening tool because it need only be computed 
once. It includes all the alternative inputs and/or outputs and thus avoids the 
combinatorial problem. From (10.16) we see that the row and column sums 
of the RGA provide a useful way of interpreting the information available in 
the singular vectors. For the case of extra inputs the RGA-values depend on 
the input scaling, and for extra outputs on the output scaling. The variables 
must therefore be scaled prior to the analysis. Note that for a nonsingular 
G the RGA is scaling independent and row and column sums to 1, i.e. the 
projection of any input or output is 1. 


Example 10.2 Cao (1995) considers the selection of manipulations in a chemical 
process for the hydrodealkylation of toluene (HDA process). The plant has 5 controlled 
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outputs and 13 candidate manipulations. At steady-state 


0.6055 0.8814 -0.1079 2.3769 —0.0540 
1.4722 5.0025 -—1.3279 8.8609 0.1824 
1.5477 —0.1083 —0.0872 0.7539 —0.0551 
2.5653 6.9433 2.2032 1.5170 8.7714 
1.4459 7.6959 0.9927 8.1797 —0.2565 
G7,(0) = | 0.0000 0.0000 0.0000 0.0000 ~—0.0000 
0.1097 —0.7272 —0.1991 1.2574 0.0217 
0.3485 2.9909 —0.8223 5.2178 0.0853 
1.5899 —0.9647 0.3648 1.1514 —8.5365 
0.0000 0.0002 + —0.5397 0.0001 0.0000 
Ee —0.1351 0.0164 0.1451 0.0041 
—0.0443 —0.1859 0.0212 0.1951 0.0054 


| 0.7878 1.1489 2.6640 —3.0928 —0.0703 


and the corresponding RGA-matrixz and column-sums A¥ are 


0.1275 —0.0755 0.5907 0.1215 0.0034 0.77 

| 0.0656 —0.0523 0.0030 0.1294 0.0002 hosts 
0.2780 0.0044 0.0463 0.4055 —0.0060 0.73 
0.3684 —0.0081 0.0009 0.0383 —0.0018 0.40 
—0.0599 0.9017 0.2079 —0.1459 0.0443 0.95 
0.1683 0.4042 0.1359 0.1376 0.0089 0.85 
AT = | 0.0000 0.0000 0.0000 0.0000 0.0000 |, Ag = | 0.00 
0.0014 —0.0017 0.0013 0.0099 0.0000 0.01 
0.0129 —0.0451 0.0230 0.1873 —0.0005 0.18 
0.0374 —0.1277 —0.0359 0.1163 0.9516 0.94 
0.0000 0.0000 0.0268 0.0000 0.0000 0.03 

| 0.0001 0.0001 0.0000 0.0001 0.0000 Ree 
0.0002 0.0002 0.0001 0.0001 0.0000 0.00 


There exist C) = 1287 combinations with 5 inputs and 5 outputs, and (3) = 1716 
combinations with 6 inputs and 5 outputs. The RGA may be useful in providing an 
initial screening. 

We find from the column sums of the steady-state RGA-matria given in AE that 
the five inputs with the largest projections onto the input space of Gan are 5, 10, 
6, 1, and 3 (in that order). For this selection o(G) = 1.73 whereas (Gan) = 2.45 
with all inputs included. This shows that we have not lost much gain in the low- 
gain direction by using only 5 of the 13 inputs. Of course, for control purposes one 
must also consider higher frequencies up to crossover. The main difference at higher 
frequency is that input 7 (which has no steady-state effect) is effective, whereas input 
1 is less effective. This may lead one to a control structure with siz inputs where input 
uz is used at high frequencies and input ui at low frequencies. 

However, there are a large number of other factors that determine controllability, 
such as RHP-zeros, sensitivity to uncertainty, and these must be taken into account 
when making the final selection. 
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The following example shows that although the RGA is an efficient screening 
tool, it must be used with some caution. 


Example 10.3 Consider a plant with 2 inputs and 4 candidate outputs of which 
we want to select 2. We have: 


io 9 | ea 3.27 
10 9 1.96 —1.43 
Gee ge gfe PG goy gag 
| 2 1 | | 0.80 2540 | 
The four row sums of RGA-matriz are 0.70, 0.53, 0.38 and 0.38. To maximize the 
output projection we should select outputs 1 and 2. However, this yields a plant Gi = 


be a which is ill-conditioned with large RGA-elements, A(Gi) = Fs ah 


and is likely to be difficult to control. On the other hand, selecting outputs 1 and 


3 yields Gy = es yl which is well-conditioned with A(G2) = es : I: For 


2 —-1 
comparison, the minimum singular values are: o(Gau) = 1.05, ¢(Gi) = 0.51, and 
a(G2) =0.70. 


We discuss in Section 10.7.2 below the selection of extra measurements for 
use in a cascade control system. 


10.6 Control configuration elements 


We now assume that the measurements, manipulations and controlled 
outputs are fixed. The available synthesis theories presented in this 
book result in a multivariable controller K which connects all available 
measurements/commands (v) with all available manipulations (uw), 


u= Kv (10.17) 


However, such a controller may not be desirable. By control configuration 
selection we mean the partitioning of measurements/commands and 
manipulations within the control layer. More specifically, we define 


Control configuration. The restrictions imposed on the overall controller 
K by decomposing it into a set of local controllers (subcontrollers, 
units, elements, blocks) with predetermined links and with a possibly 
predetermined design sequence where subcontrollers are designed locally. 


In a conventional feedback system a typical restriction on K is to use a one 
degree-of-freedom controller (so that we have the same controller for r and 
—y). Obviously, this limits the achievable performance compared to that of a 
two degrees-of-freedom controller. In other cases we may use a two degrees-of- 
freedom controller, but we may impose the restriction that the feedback part 
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of the controller (K,) is first designed locally for disturbance rejection, and 

then the prefilter (K,.) is designed for command tracking. In general this will 

limit the achievable performance compared to a simultaneous design, see also 

the Remark on page 113. Similar arguments apply to other cascade schemes. 
Some elements used to build up a specific control configuration are: 


Cascade controllers 
Decentralized controllers 
Feedforward elements 
Decoupling elements 
Selectors 


These are discussed in more detail below, and in the context of the process 
industry in Shinskey (1988) and Balchen and Mumme (1988). First, some 
definitions: 


Decentralized control is when the control system consists of independent 
feedback controllers which interconnect a subset of the output 
measurements/commands with a subset of the manipulated inputs. These 
subsets should not be used by any other controller. 


This definition of decentralized control is consistent with its use by the 
control community. In decentralized control we may rearrange the ordering 
of measurements/commands and manipulated inputs such that the feedback 
part of the overall controller K in (10.17) has a fixed block-diagonal structure. 


Cascade control is when the output from one controller is the input to 
another. This is broader then the conventional definition of cascade 
control which is that the output from one controller is the reference 
command (setpoint) to another. 


Feedforward elements link measured disturbances and manipulated inputs. 


Decoupling elements link one set of manipulated inputs (“measurements”) 
with another set of manipulated inputs. They are used to improve the 
performance of decentralized control systems, and are often viewed as 
feedforward elements (although this is not correct when we view the 
control system as a whole) where the “measured disturbance” is the 
manipulated input computed by another decentralized controller. 


Selectors are used to select for control, depending on the conditions of the 
system, a subset of the manipulated inputs or a subset of the outputs. 


In addition to restrictions on the structure of kK, we may impose restrictions 
on the way, or rather in which sequence, the subcontrollers are designed. 
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For most decomposed control systems we design the controllers sequentially, 
starting with the “fast” or “inner” or “lower-layer” control loops in the control 
hierarchy. In particular, this is relevant for cascade control systems, and it is 
sometimes also used in the design of decentralized control systems. 

The choice of control configuration leads to two different ways of 
partitioning the control system: 


e Vertical decomposition. This usually results from a sequential design of 
the control system, e.g. based on cascading (series interconnecting) the 
controllers in a hierarchical manner. 

e Horizontal decomposition. This usually involves a set of independent 
decentralized controllers. 


Remark 1 Sequential design of a decentralized controller results in a control system 
which is decomposed both horizontally (since K is diagonal) as well as vertically 
(since controllers at higher layers are tuned with lower-layer controllers in place). 


Remark 2 Of course, a performance loss is inevitable if we decompose the control 
system. For example, for a hierarchical decentralized control system, if we select a 
poor configuration at the lower (base) control layer, then this may pose fundamental 
limitations on the achievable performance which cannot be overcome by advanced 
controller designs at higher layers. These limitations imposed by the lower-layer 
controllers may include RHP-zeros (see the aero-engine case study in Chapter 12) 
or strong interactions (see the distillation case study in Chapter 12 where the LV- 
configuration yields large RGA-elements at low frequencies). 


In this section, we discuss cascade controllers and selectors, and give some 
justification for using such “suboptimal” configurations rather than directly 
designing the overall controller kK. Later, in section 10.7 we discuss in more 
detail the hierarchical decomposition, including cascade control, partially 
controlled systems and sequential controller design. Finally, in section 10.8 
we consider decentralized diagonal control. 


10.6.1 Cascade control systems 


We want to illustrate how a control system which is decomposed into 
subcontrollers can be used to solve multivariable control problems. For 
simplicity, we here use single-input single-output (SISO) controllers of the 
form 

Ui = K;(s) (r; = Ys) (10.18) 


where K;(s) is a scalar. Note that whenever we close a SISO control loop we 
lose the corresponding input, u;, as a degree-of-freedom, but at the same time 
the reference, r;, becomes a new degree of freedom. 

It may look like it is not possible to handle nonsquare systems with 
SISO controllers. However, since the input to the controller in (10.18) is a 
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reference minus a measurement, we can cascade controllers to make use of 
extra measurements or extra inputs. A cascade control structure results when 
either of the following two situations arise. 


e The reference r; is an output from another controller (typically used for the 
case of an extra measurement y;), see Figure 10.3 (a). This is conventional 
cascade control. 

e The “measurement” y; is an output from another controller (typically used 
for the case of an extra manipulated input u,, i.g. in Figure 10.3 (b), ue is 
the measurement for controller Ky. This cascade scheme is here referred to 
as input resetting. 


(b) Extra inputs uz (input resetting) 


Figure 10.3: Cascade implementations 


10.6.2 Cascade control: Extra measurements 


In many cases we make use of extra measurements yo (secondary outputs) to 
provide local disturbance rejection and linearization, or to reduce the effect 
of measurement noise. For example, velocity feedback is frequently used in 
mechanical systems, and local flow cascades are used in process systems. Let 
u be the manipulated input, y: the controlled output (with an associated 
control objective r;) and ys the extra measurement. 

Centralized (parallel) implementation. A centralized implementation 
u = K(r—y), where K is a 2-input-1-output controller, may be written 


U= Kir(s)(r1 > Yi) oh Ko1(s) (ro oa Yy2) (10.19) 


where in most cases rg = 0 (since we do not have a degree-of-freedom to 
control ys). 
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Cascade implementation (conventional cascade control). To obtain 
an implementation with two SISO controllers we may cascade the controllers 
as illustrated in Figure 10.3 (a): 


re = Kyi(s)(r1 — yx), (10.20) 


Ug = Ko(s)(r2 — yo), To = th (10.21) 


Note that the output ra from the slower primary controller Ky, is not a 
manipulated plant input, but rather the reference input to the faster secondary 
(or slave) controller K2. For example, cascades based on measuring the actual 
manipulated variable (in which case y2 = um) are commonly used to reduce 
uncertainty and nonlinearity at the plant input. 

With ra = 0 in (10.19) the relationship between the centralized and cascade 
implementation is Ky, = KK, and Ko, = Ko. 

An advantage with the cascade implementation is that it more clearly 
decouples the design of the two controllers. It also shows more clearly that 
ro is not a degree-of-freedom at higher layers in the control system. Finally, 
it allows for integral action in both loops (whereas usually only Ky, should 
have integral action in (10.19)). 

On the other hand, a centralized implementation is better suited for direct 
multivariable synthesis; see the velocity feedback for the helicopter case study 
in Section 11.3. 


Figure 10.4: Common case of cascade control where the primary output yi depends 
directly on the extra measurement y2. y2 


Remark. Consider conventional cascade control in Figure 10.3 (a). In the general 
case y; and y2 are not directly related to each other, and this is sometimes referred 
to as parallel cascade control. However, it is common to encounter the situation in 


Figure 10.4 where y1 depends directly on y2. This is a special case of Figure 10.3 (a) 


with “Plant” = fea , and it is considered further in Example 10.1. 


Exercise 10.1 Conventional cascade control. With reference to the special 
(but common) case of conventional cascade control shown in Figure 10.4, Morari 
and Zafiriou (1989) conclude that the use of extra measurements is useful under the 
following circumstances: 


(a) The disturbance dz is significant and Gy is non-minimum phase. 
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(b) The plant Gy has considerable uncertainty associated with it — e.g. a poorly 
known nonlinear behaviour — and the inner loop serves to remove the 
uncertainty. 


In terms of design they recommended that K2 is first designed to minimize the effect 
of dz on y: (with K, = 0) and then Ky is designed to minimize the effect of d on 
yi. We want to derive conclusions (a) and (b) from an input-output controllability 
analysis, and also, (c) explain why we may choose to use cascade control if we want 
to use simple controllers (even with dz = 0). 

Outline of solution: (a) Note that if G1 is minimum phase, then the input-output 
controllability of G2 and GiG2 are in theory the same, and for rejecting d2 there 
is no fundamental advantage in measuring y1 rather than yo. (b) The inner loop 
L2 =G2Ke removes the uncertainty if it is sufficiently fast (high gain feedback) and 
yields a transfer function (I+ L2)~'Ly close to I at frequencies where K, is active. 
(c) In most cases, such as when PID controllers are used, the practical bandwidth is 
limited by the frequency wy where the phase of the plant is —180° (see section 5.12), 
so an inner cascade loop may yield faster control (for rejecting dy and tracking r1) 
if the phase of G2 is less than that of GiGo. 


Exercise 10.2 To illustrate the benefit of using inner cascades for high-order 
plants, case (c) in the above example, consider Figure 10.4 and let 


1 1 


C= (s+ 1)?’ Ga s+1 


We use a fast proportional controller Kz = 25 in the inner loop, whereas a somewhat 
slower PID-controller is used in the outer loop, 


(s +1)? 


c Ke= 
s(0.1s +1)’ : 


Ki(s)=K. 


Sketch the closed-loop response. What is the bandwidth for each of the two loops? 

Compare this with the case where we only measure yi, 80 G = G1G2, and use a 
PID-controller K(s) with the same dynamics as K1(s) but with a smaller value of 
K.. What is the achievable bandwidth? Find a reasonable value for K. (starting with 
K. =1) and sketch the closed-loop response (you will see that it is about a factor 5 
slower without the inner cascade). 


10.6.3. Cascade control: Extra inputs 


In some cases we have more manipulated inputs than controlled outputs. 
These may be used to improve control performance. Consider a plant with a 
single controlled output y and two manipulated inputs wu; and ug. Sometimes 
ug is an extra input which can be used to improve the fast (transient) control 
of y, but if it does not have sufficient power or is too costly to use for long- 
term control, then after a while it is reset to some desired value (“ideal resting 
value” ). 
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Centralized (parallel) implementation. A centralized implementation 
u = K(r—y) where K is a 1-input 2-output controller, may be written 


uy = Kyi(s)(r—y), ug = Koi(s)(r — y) (10.22) 


Here two inputs are used to control one output, so to get a unique steady-state 
for the inputs uy and ug, Ky, has integral control but Ky; does not. Then 
u(t) will only be used for transient (fast) control and will return to zero (or 
more precisely to its desired value r,,,) as t 4 co. 

Cascade implementation (input resetting). To obtain an implemen- 
tation with two SISO controllers we may cascade the controllers as shown in 
Figure 10.3 (b). We again let input us take care of the fast control and wu, of 
the long-term control. The fast control loop is then 


uy = Ko(s)(r — y) (10.23) 


The objective of the other slower controller is then to use input wu, to reset 
input ug to its desired value r,,: 


U1 = KA (8)(Tuan —Y1), Yr = U2 (10.24) 


and we see that the output from the fast controller K2 is the “measurement” 
for the slow controller Ky. 

With rz, = 0 the relationship between the centralized and cascade 
implementation is Ky, = —K,K2 and Ko, = Ko. 

The cascade implementation again has the advantage of decoupling the 
design of the two controllers. It also shows more clearly that r,,, the reference 
for u2, may be used as a degree-of-freedom at higher layers in the control 
system. Finally, we can have integral action in both Ky and Ko, but note 
that the gain of Ky should be negative (if effects of ui and uz on y are both 
positive). 


Remark 1 Typically, the controllers in a cascade system are tuned one at a time 
starting with the fast loop. For example, for the control system in Figure 10.5 we 
would probably tune the three controllers in the order K2 (inner cascade using fast 
input), Ks (input resetting using slower input), and K, (final adjustment of y1). 


Remark 2 In process control, the cascade implementation of input resetting is 
sometimes referred to as valve position control, because the extra input w2, usually 
a valve, is reset to a desired position by the outer cascade. 


Exercise 10.3 Draw the block diagrams for the two centralized (parallel) 
implementations corresponding to Figure 10.3. 


Exercise 10.4 Derive the closed-loop transfer functions for the effect of r on y, ur 
and uz for the cascade input resetting scheme in Figure 10.8 (b). As an erample use 
G=[Giu Gie]=[1 1] and use integral action in both controllers, K, = —1/s 
and Ky = 10/s. Show that input ug is reset at steady-state. 
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Example 10.4 Two layers of cascade control. Consider the system in 
Figure 10.5 with two manipulated inputs (u2 and uz), one controlled output (y1 
which should be close to r1) and two measured variables (y1 and y2). Input u2 has a 
more direct effect on y: than does input ug (there is a large delay in G3(s)). Input 
u2 should only be used for transient control as it is desirable that it remains close to 
T3 = Tu. The extra measurement yo is closer than y; to the input u2 and may be 
useful for detecting disturbances (not shown) affecting G1. 


Figure 10.5: Control configuration with two layers of cascade control. 


In Figure 10.5 controllers K, and Ky are cascaded in a conventional manner, 
whereas controllers Ky and Kz are cascaded to achieve input resetting. The 
corresponding equations are 


mn = Ki(s)(r1 > y1) (10.25) 
uz = Ko(s)(r2 — ye), r2 = tH (10.26) 
uz = K3(s)(r3 — ys), 3 = U2 (10.27) 


Controller K, controls the primary output yi at its reference r; by adjusting the 
“input” U1, which is the reference value for y2. Controller K2 controls the secondary 
output yo using input u2. Finally, controller K3 manipulates ug slowly in order to 
reset input ua to its desired value rz. 


Exercise 10.5 Process control application. A practical case of a control 
system like the one in Figure 10.5 is in the use of a pre-heater to keep the reactor 
temperature yi at a given value r;. In this case y2 may be the outlet temperature 
from the pre-heater, u2 the bypass flow (which should be reset to r3, say 10% of the 
total flow), and uz the flow of heating medium (steam). Make a process flowsheet 
with instrumentation lines (not a block diagram) for this heater/reactor process. 


Example 10.5 Decentralized control. Consider the a plant consisting of G1, 
G2 and G3 as in Figure 10.5. If we had a different control objective, namely to keep 
both yi and yz at desired values r; and rz (independent of each other), then this could 
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be implemented using two independent decentralized controllers, uz = Ki(ri — y1), 
and u2 = Ko(re — y2) (not shown in Figure). 


10.6.4 Extra inputs and outputs (local feedback) 


In many cases performance may be improved with local feedback loops 
involving extra manipulated inputs and extra measurements. However, the 
improvement must be traded off against the cost for the extra actuator, 
measurement and control system. An example where local feedback is required 
to counteract the effect of high-order lags is given for a neutralization process 
in Figure 5.24 on page 222. The use of local feedback is also discussed by 
Horowitz (1991). 


10.6.5 Selectors 


Split-range control for extra inputs. We assumed above that the extra 
input is used to improve dynamic performance. Another situation is when 
input constraints make it necessary to add a manipulated input. In this case 
the control range is often split such that, for example, u, is used for control 
when y € [Ymin, yi], and uz is used when y € [41, Ymax]- 

Selectors for too few inputs. A completely different situation occurs 
if there are too few inputs. Consider the case with one input (uw) and 
several outputs (y,y2,...). In this case, we cannot control all the outputs 
independently, so we either need to control all the outputs in some average 
manner, or we need to make a choice about which outputs are the most 
important to control. Selectors or logic switches are often used for the latter. 
Auctioneering selectors are used to decide to control one of several similar 
outputs. For example, this may be used to adjust the heat input (w) to keep the 
maximum temperature (max; y;) in a fired heater below some value. Override 
selectors are used when several controllers compute the input value, and we 
select the smallest (or largest) as the input. For example, this is used in a 
heater where the heat input (uw) normally controls temperature (y1), except 
when the pressure (yo) is too large and pressure control takes over. 


10.6.6 Why use cascade and decentralized control? 


As is evident from Figure 10.5(a), decomposed control configurations can 
easily become quite complex and difficult to maintain and understand. It 
may therefore be both simpler and better in terms of control performance to 
set up the controller design problem as an optimization problem and let the 
computer do the job, resulting in a centralized multivariable controller as used 
in other chapters of this book. 
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Tf this is the case, why is cascade and decentralized control used in practice? 
There are a number of reasons, but the most important one is probably the 
cost associated with obtaining good plant models, which are a prerequisite 
for applying multivariable control. On the other hand, with cascade and 
decentralized control each controller is usually tuned one at a time with a 
minimum of modelling effort, sometimes even on-line by selecting only a few 
parameters (e.g, the gain and integral time constant of a Pl-controller). The 
main reason for applying cascade and decentralized control is thus to save on 
modelling effort. Since cascade and decentralized control systems depend more 
strongly on feedback rather than models as their source of information, it is 
usually more important (relative to centralized multivariable control) that the 
fast control loops be tuned to respond quickly. 

Other advantages of cascade and decentralized control include the following: 
they are often easier to understand by operators, they reduce the need 
for control links and allow for decentralized implementation, their tuning 
parameters have a direct and “localized” effect, and they tend to be insensitive 
to uncertainty, for example, in the input channels (see Section 8.6.1). The issue 
of simplified implementation and reduced computation load is also important 
in many applications, but is becoming less relevant as the cost of computing 
power is reduced. 

Based on the above discussion, the main challenge is to find a control 
configuration which allows the (sub)controllers to be tuned independently 
based on a minimum of model information (the pairing problem). For 
industrial problems, the number of possible pairings is usually very high, but 
in most cases physical insight and simple tools, such as the RGA, can be 
helpful in reducing the number of alternatives to a manageable number. To 
be able to tune the controllers independently, we must require that the loops 
interact only to a limited extent. For example, one desirable property is that 
the steady-state gain from u; to y; in an “inner” loop (which has already been 
tuned), does not change too much as outer loops are closed. For decentralized 
diagonal control the RGA is a useful tool for addressing this pairing problem. 

Why do we need a theory for cascade and decentralized control? We just 
argued that the main advantage of decentralized control was its saving on 
the modelling effort, but any theoretical treatment of decentralized control 
requires a plant model. This seems to be a contradiction. However, even 
though we may not want to use a model to tune the controllers, we may 
still want to use a model to decide on a control structure and to decide 
on whether acceptable control with a decentralized configuration is possible. 
The modelling effort in this case is less, because the model may be of a 
more “generic” nature and does not need to be modified for each particular 
application. 

We present below some theoretical results for the design of decomposed 
control systems. In Section 10.7 we consider the hierarchical (vertical) 
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decomposition, and in Section 10.8 we consider the horizontal decomposition 
into independent decentralized controllers. 


10.7 Hierarchical and partial control 


A hierarchical control system results when we design the subcontrollers in 
a sequential manner, usually starting with the fast loops ( “bottom-up” 
). This means that the controller at some higher layer in the hierarchy is 
designed based on a partially controlled plant. In this section we derive 
transfer functions for partial control, and provide some guidelines for designing 
hierarchical control systems. 


10.7.1 Partial control 


Partial control involves controlling only a subset of the outputs for which there 
is a control objective. We divide the outputs into two classes: 


e y; — (temporarily) uncontrolled output (for which there is an associated 
control objective) 
@ y2 — (locally) measured and controlled output 


We also subdivide the available manipulated inputs in a similar manner: 


e wu. — inputs used for controlling y2 
e uz — remaining inputs (which may be used for controlling y1) 


We have inserted the word temporarily above, since y; is normally a 
controlled output at some higher layer in the hierarchy. However, we here 
consider the partially controlled system as it appears after having implemented 
only a local control system where uz is used to control yz. In most of the 
development that follows we assume that the outputs y are tightly controlled. 

Four applications of partial control are: 


1. Sequential design of decentralized controllers. The outputs y (which include 
yi and yg) all have an associated control objective, and we use a hierarchical 
control system. We first design a controller Kz to control the subset yo. 
With this controller Ky, in place (a partially controlled system), we may 
then design a controller kK, for the remaining outputs. 

2. Sequential design of conventional cascade control. The outputs yo are 
additional measured (“secondary”) variables which are not important 
variables in themselves. The reason for controlling y2 is to improve 
the control of y,. The references rj are used as degrees-of-freedom for 
controlling y1 so the set u is often empty. 
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3. “True” partial control. The outputs y (which include y; and yo) all have an 
associated control objective, and we consider whether by controlling only 
the subset ya; we can indirectly achieve acceptable control of y;, that is, 
the outputs y; remain uncontrolled and the set ui remains unused. 

4. Indirect control. The outputs y, have an associated control objective, 
but they are not measured. Instead, we aim at indirectly controlling my 
by controlling the “secondary” measured variables y2 (which have no 
associated control objective). The references ro are used as degrees of 
freedom and the set wu; is empty. This is similar to cascade control, but 
there is no “outer” loop involving y,. Indirect control was discussed in 
Section 10.3. 


The following table shows more clearly the difference between the four 
applications of partial control (In all cases there is a control objective 
associated with y,; and a measurement of yo): 


Measurement Control objective 


of y? for yo? 
Sequential decentralized control Yes Yes 
Sequential cascade control Yes No 
“True” partial control No Yes 
Indirect control No No 


The four problems are closely related, and in all cases we want the effect 
of the disturbances on y; to be small, when y2 is controlled. Let us derive 
the transfer functions for y; when y2 is controlled. One difficulty is that this 
requires a separate analysis for each choice of yz and uz, and the number of 
alternatives has a combinatorial growth as illustrated by (10.12). 

By partitioning the inputs and outputs, the overall model y = Gu may be 
written 


yw = Guy + Giote + Gad (10.28) 
yo = Gait + Gogue + Gaod (10.29) 


Assume now that feedback control ug = K2(r2—ye2) is used for the “secondary” 
subsystem involving ug and yo. By eliminating ug and yo, we then get the 
following model for the resulting partially controlled system: 


m= (Gu — Gio Ko(I + Go2K2)"'Gai) ur + 
(Ga — Gyo Ko(I + Gx2K2)~'Ga2) d+ 
G2 Ko (I + Go9 Ka) ro (10.30) 


Remark. This may be rewritten in terms of linear fractional transformations. For 
example, the transfer function from u1 to y1 is 


Fi(G, —K2) = Gu — Gi2K2(I + Gao K2) "Gar (10.31) 
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Perfect control of y2. In some cases we can assume that the control of yo is 
very fast compared to the control of y,, so that we effectively have that yo is 
perfectly controlled when considering the control of y,. To obtain the model 
we may formally let Ky — co in (10.30), but it is better to set yg = ro and 
solve for uz in (10.29) to get 


Ug = -Ga' God ct Goo Goth + Goo Te 


We have here assumed that G2 is square and invertible, otherwise we can 
get the least-square solution by replacing G5,| by the pseudo-inverse, Gb. On 
substituting this into (10.28) we get 


w= (Gi a Gi2Go5 Gor) Uy + (Ga oa Gi2Gon Gaz) d + GieGoo T2 (10.32) 
ee ————_ eS —SS 


A A A 
=P =P, =P, 


where Pz is called the partial disturbance gain, which is the disturbance gain 
for a system under perfect partial control, and P,, is the effect of ui on yi 
with yo perfectly controlled. In many cases the set uy is empty (there are no 
extra inputs). The advantage of the model (10.32) over (10.30) is that the it 
is independent of Ky, but we stress that it only applies at frequencies where 
yo is tightly controlled. 


Remark. Relationships similar to those given in (10.32) have been derived by many 
authors, e.g. see the work of Manousiouthakis, Savage and Arkun (1986) on block 
relative gains and the work of Haggblom and Waller (1988) on distillation control 
configurations. 


10.7.2 Hierarchical control and sequential design 


A hierarchical control systems arises when we apply a sequential design 
procedure to a cascade or decentralized control system. 

The idea is to first implement a local lower-layer (or inner) control system 
for controlling the outputs y2. Next, with this lower-layer control system in 
place, we design a controller Ky to control y;. The appropriate model for 
designing Ky is given by (10.30) (for the general case) or (10.32) (for the case 
when we can assume yo perfectly controlled). 

The objectives for this hierarchical decomposition may vary: 


1. To allow for simple or even on-line tuning of the lower-layer control system 
(Ka). 

2. To allow the use of longer sampling intervals for the higher layers (K1). 

3. To allow simple models when designing the higher-layer control system 
(K,). The high-frequency dynamics of the models of the partially controlled 
plant (e.g. P,, and P,.) may be simplified if K, is mainly effective at lower 
frequencies. 
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4. To “stabilize”! the plant using a lower-layer control system (K2) such that 
it is amenable to manual control. 


The latter is the case in many process control applications where we first 
close a number of faster “regulatory” loops in order to “stabilize” the plant. 
The higher layer control system (A) is then used mainly for optimization 
purposes, and is not required to operate the plant. 

Based on these objectives, Hovd and Skogestad (1993) proposed some 
criteria for selecting w2 and yo to be included in the lower-layer control 
system: 


1. The lower layer must quickly implement the setpoints computed by the 
higher layers, that is, the input-output controllability of the subsystem 
involving use of ug to control y2 should be good (consider Go2 and Gaz). 

2. The control of yo using uy should provide local disturbance rejection, that 
is, it should minimize the effect of disturbances on y, (consider P, for yo 
tightly controlled). 

3. The control of yo using u2 should not impose unnecessary control 
limitations on the remaining control problem which involves using u1 
and/or re to control y;. By “unnecessary” we mean limitations (RHP- 
zeros, ill-conditioning, etc.) that did not exist in the original overall 
problem involving u and y. Consider the controllability of P, for ye tightly 
controlled, which should not be much worse than that of G. 


These three criteria are important for selecting control configurations for 
distillation columns as is discussed in the next example. 


Example 10.6 Control configurations for distillation columns. The overall 
control problem for the distillation column in Figure 10.6 has 5 inputs 


u=[L V D BvVr]" 


(these are all flows: reflux L, boilup V, distillate D, bottom flow B, overhead vapour 
Vr) and 5 outputs 
y=[yo te Mp Mp pl’ 


(these are compositions and inventories: top composition yp, bottom composition 
tp, condenser holdup Mp, reboiler holdup Mz, pressure p) see Figure 10.6. This 
problem usually has no inherent control limitations caused by RHP-zeros, but the 
plant has poles in or close to the origin and needs to be stabilized. In addition, 
for high-purity separations the 5 x 5 RGA-matriz may have some large elements at 
low frequencies. Another complication is that composition measurements are often 
ezpensive and unreliable. 


T The terms “stabilize” and “unstable” as used by process operators may not refer to a 
plant that is unstable in a mathematical sense, but rather to a plant that is sensitive to 
disturbances and which is difficult to control manually. 
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In most cases, the distillation column is first stabilized by closing three 
decentralized SISO loops for level and pressure so 


yo=[Mp Mp pl" 


and the remaining outputs are 


yi=[yo ee)" 

The three SISO loops for controlling y2 usually interact weakly and may be tuned 
independently of each other. However, since each level (tank) has an inlet and 
two outlet flows, there exists many possible choices for u2 (and thus for u1). 
By convention, each choice (“configuration”) is named by the inputs uy, left for 
composition control. 


Figure 10.6: Typical distillation column with LV-control configuration 


For ezample, the “LV -configuration” used in many examples in this book refers 
to a partially controlled system where we use 


u=[L Vv] 


to control y: (and we assume that there is a control system in place which uses 
us=[D B Vr]" to control yo). The LV -configuration is good from the point of 
view that control of y1 using ui is nearly independent of the tuning of the controller 
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Ke involving yo and uz. However, the problem of controlling yi by ui (“plant” P,) 
is often strongly interactive with large steady-state RGA-elements in Py. 
Another configuration is the DV -configuration where 


uw =[D vy" 


and thus ug = [L B Vr)’. In this case, the steady-state interactions from u1 
to yi are generally much less, and P, has small RGA-elements. But the model in 
(10.80) depends strongly on Ky (i.e. on the tuning of the level loops), and a slow 
level loop for Mp may introduce unfavourable dynamics for the response from u, to 
yi. 

There are also many other possible configurations (choices for the two inputs in 
ui); with five inputs there are 10 alternative configurations. Furthermore, one often 
allows for the possibility of using ratios between flows, e.g. L/D, as possible degrees- 
of-freedom in ui and this sharply increases the number of alternatives. 

Expressions which directly relate the models for various configurations, e.g. 
relationships between PEY,P?Y and PP’,PP” etc., are given in Haggblom and 
Waller (1988) and Skogestad and Morari (1987a). However, it may be simpler to 
start from the overall 5 x 5 model G, and derive the models for the configurations 
using (10.80) or (10.82), see also the MATLAB file on page 524. 

To select a good distillation control configuration, one should first consider the 
problem of controlling levels and pressure (y2). This eliminates a few alternatives, 
so the final choice is based on the 2 x 2 composition control problem (yi). If ya is 
tightly controlled then none of the configurations seem to yield RHP-zeros in P,. 
Important issues to consider then are disturbance sensitivity (the partial disturbance 
gain Py should be small) and the interactions (the RGA-elements of P,). These 
issues are discussed by, for erample, Waller, Haggblom, Sandelin and Finnerman 
(1988) and Skogestad, Lundstrém and Jacobsen (1990). Another important issue is 
that it is often not desirable to have tight level loops and some configurations, like 
the DV -configuration mentioned above, are sensitive to the tuning of K2. Then the 
expressions for Py and Py, which are used in the references mentioned above, may 
not apply. This is further discussed in Skogestad (1992). 

Because of the problems of interactions and the high cost of composition 
measurements, we often find in practice that only one of the two product 
compositions is controlled (“true” partial control). This is discussed in detail 
in Example 10.8 below. Another common solution is to make use of additional 
temperature measurements from the column, where their reference values are set 
by a composition controller in a cascade manner. 


In summary, the overall 5 x 5 distillation control problem is usually solved 
by first designing a 3 x 3 controller Ky for levels and pressure, and then 
designing a 2 x 2 controller A, for the the composition control. This is then 
a case of (block) decentralized control where the controller blocks Ky and Ko 
are designed sequentially (in addition, the blocks K, and Ky may themselves 
be decentralized). 

Sequential design is also used for design of cascade control systems. This is 
discussed next. 
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Sequential design of cascade control systems 


Consider the conventional cascade control system in Figure 10.3 (a), where 
we have additional “secondary” measurements yo with no associated control 
objective, and the objective is to improve the control of the primary outputs 
yi by locally controlling yo. The idea is that this should reduce the effect of 
disturbances and uncertainty on y1. 

From (10.32), it follows that we should select secondary measurements y2 
(and inputs w2) such that ||Pa|| is small and at least smaller than ||Gaz||. 
In particular, these arguments apply at higher frequencies. Furthermore, it 
should be easy to control y; by using as degrees-of-freedom the references rz 
(for the secondary outputs) or the unused inputs wu. More precisely, we want 
the input-output controllability of the “plant” [P,, P,] (or P, if the set uz 
is empty) with disturbance model Pj, to be better than that of the plant 
[Gi1 Gig] (or Giz) with disturbance model Gat. 


Remark. Most of the arguments given in Section 10.2, for the separation into an 
optimization and a control layer, and in Section 10.3, for the selection of controlled 
outputs, apply to cascade control if the term “optimization layer” is replaced by 
“primary controller”, and “control layer” is replaced by “secondary controller”. 


Exercise 10.6 The block diagram in Figure 10.4 shows a cascade control system 
where the primary output yi depends directly on the extra measurement y2, $0 
Gi2 = GiG2, Go2 =G2, Gar =[I Gi] andGa2 =[0 I]. Show that Pi =[I 0] 
and P, = G1 and discuss the result. Note that P, is the “new” plant as it appears 
with the inner loop closed. 


10.7.3. “True” partial control 


We here consider the case where we attempt to leave a set of primary outputs 
yt uncontrolled. This “true” partial control may be possible in cases where 
the outputs are correlated such that controlling the outputs yo indirectly 
gives acceptable control of y,. One justification for partial control is that 
measurements, actuators and control links cost money, and we therefore prefer 
control schemes with as few control loops as possible. 

To analyze the feasibility of partial control, consider the effect of 
disturbances on the uncontrolled output(s) y, as given by (10.32). Suppose 
all variables have been scaled as discussed in Section 1.4. Then we have that: 


e A set of outputs y, may be left uncontrolled only if the effects of all 
disturbances on y1, as expressed by the elements in the corresponding partial 
disturbance gain matriz Py, are less than 1 in magnitude at all frequencies. 


Therefore, to evaluate the feasibility of partial control one must for each choice 
of controlled outputs (ya) and corresponding inputs (uz), rearrange the system 
as in (10.28) and (10.29) and compute Py using (10.32). 
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There may also be changes in rz (of magnitude Rz) which may be regarded 
as disturbances on the uncontrolled outputs y,. From (10.32) then, we also 
have that: 


e A set of outputs y, may be left uncontrolled only if the effects of all reference 
changes in the controlled outputs (y2) on yi, as expressed by the elements 
in the matrix Gi2G5o Ro, are less than 1 in magnitude at all frequencies. 


One uncontrolled output and one unused input. “True” partial 
control is often considered if we have an m x m plant G(s) where acceptable 
control of all m outputs is difficult, and we consider leaving input u; unused 


and output y; uncontrolled. In this case, instead of rearranging y into A and 


u into ie for each candidate control configuration, we may directly evaluate 


2 
the partial disturbance gain based on the overall model y = Gu + Gad. The 
effect of a disturbance d; on the uncontrolled output y; is 


Oys ) IG*Galir 
P, = Se) 10.33 
< & w-gico Ip ) 


where “u; = 0,yi4; = 0” means that input u; is constant and the remaining 
outputs yj4; are constant. 


Proof of (10.33): The proof is from (Skogestad and Wolff, 1992). Rewrite y = 
Gu + [Ga]edzy as u = Guty — [G7 "]hGad. Set y = 0 for all # 7%. Then uj; = 
G"]jeyi —[G7'Ga]jndy and by setting uj = 0 we find y;/dx =[G7'Ga]jx/[Go ji. 


From (10.33) we derive direct insight into how to select the uncontrolled 
output and unused input: 


1. Select the unused input uj such that the j’th row in G7'Ga has small 
elements. That is, keep the input constant (unused) if its desired change is 
small. 

2. Select the uncontrolled output y; and unused input wu; such that the 
jv’th element in G-! is large. That is, keep an output uncontrolled if 
it is insensitive to changes in the unused input with the other outputs 
controlled. 


Example 10.7 Consider the FCC process in Exercise 6.17 with 


G(0)=|-16.7 31.0 —1.41 0.03 0.03 —0.02 


16.8 30.5 4.30 i 0.09 0.02 —0.06 
|: a7(0) = 
1.27 54.1 5.40 —0.34 —0.32 0.38 


where we want leave one input unused and one output uncontrolled. From the second 
rule, since all elements in the third row of G~! are large, it seems reasonable to let 
input ug be unused, as is done in Exercise 6.17. (The outputs are mainly selected to 
avoid the presence of RHP-zeros, see Exercise 6.17.) 
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We next consider a 2 x 2 distillation process where it is difficult to control 
both outputs independently due to strong interactions, and we leave one 
output (y,) uncontrolled. To improve the performance of y; we also consider 
the use of feedforward control where wu; is adjusted based on measuring the 
disturbance (but we need no measurement of y1). 


Example 10.8 Partial and feedforward control of 2 x 2 distillation 
process. Consider a distillation process with 2 inputs (reflux L and boilup V ), 
2 outputs (product compositions yn and xp) and 2 disturbances (feed flowrate F 
and feed composition zr). We assume that changes in the reference (r1 and r2) are 
infrequent and they will not be considered. At steady-state (s =0) we have 


_ [87.8 -86.4 _ | 7.88 8.81 1, _ [0.54 —0.005 
Ga tes ee coe male Gam | a oe 


(10.34) 
Since the rows elements in G~'Gg are similar in magnitude as are also the elements 
of G~' (between 0.3 and 0.4), the rules following (10.33) do not clearly favour any 
particular partial control scheme. This is confirmed by the values of Pi, which are 
seen to be quite similar for the four candidate partial control schemes: 


1.36 |" 1.63]" 1.72] 2.00]° 
yy dt. pd. ae ODS os . i . 
fa= eerie ofa = ee phe = Ral plas = lass 


The superscripts here denote the controlled output and corresponding input. 
Importantly, in all four cases, the magnitudes of the elements in Py are much smaller 
than in Ga, so control of one output significantly reduces the effect of the disturbances 
on the uncontrolled output. In particular, this is the case for disturbance 2, for which 
the gain is reduced from about 10 to 0.33 and less. 

Let us consider in more detail scheme 1 which has the smallest disturbance 
sensitivity for the uncontrolled output (Pz). This scheme corresponds to controlling 
output yo (the bottom composition) using u2 (the boilup V) and with y, (the top 
composition) uncontrolled. We use a dynamic model which includes liquid flow 
dynamics; the model is given in Section 12.4. Frequency-dependent plots of Ga and 
Py show that the conclusion at steady state also applies at higher frequencies. This 
is illustrated in Figure 10.7, where we show for the uncontrolled output y; and the 
worst disturbance d, both the open-loop disturbance gain (Gai1, Curve 1) and the 
partial disturbance gain (Pi, Curve 2). For disturbance dz the partial disturbance 
gain (not shown) remains below 1 at all frequencies. 

The partial disturbance gain for disturbance d, (the feed flowrate F') is somewhat 
above 1 at low frequencies (Pa(0) = —1.36), so let us next consider how we may 
reduce its effect on y1. One approach is to reduce the disturbance itself, for example, 
by installing a buffer tank (as in pH-example in Chapter 5.16.3). However, a buffer 
tank has no effect at steady-state, so it does not help in this case. 

Another approach is to install a feedforward controller based on measuring di 
and adjusting u, (the refluz L) which is so far unused. In practice, this is easily 
implemented as a ratio controller which keeps L/F constant. This eliminates the 
steady-state effect of di on y1 (provided the other control loop is closed). In terms 
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Figure 10.7: Effect of disturbance 1 on output 1 for distillation column example. 


of our linear model, the mathematical equivalence of this ratio controller is to 
use ui = —0.54d1, where —0.54 is the 1,1-element in —G~'Ga. The effect of 
the disturbance after including this static feedforward controller is shown as curve 
3 in Figure 10.7. However, due to measurement error we cannot achieve perfect 
feedforward control, so let us assume the error is 20%, and use ui: = —1.2-0.54d1. 
The steady-state effect of the disturbance is then P,(0)(1 — 1.2) = 1.36 - 0.2 = 0.27, 
which is still acceptable. But, as seen from the frequency-dependent plot (curve 4), 
the effect is above 0.5 at higher frequencies, which may not be desirable. The reason 
for this undesirable peak is that the feedforward controller, which is purely static, 
reacts too fast, and in fact makes the response worse at higher frequencies (as seen 
when comparing curves 3 and 4 with curve 2). To avoid this we filter the feedforward 
action with a time constant of 3 min resulting in the following feedforward controller: 


0.54 

= —~———d 10.35 

oe 35+1 ; ( ) 
To be realistic we again assume an error of 20%. The resulting effect of the 
disturbance on the uncontrolled output is shown by curve 5, and we see that the 


effect is now less than 0.27 at all frequencies, so the performance is acceptable. 


Remark. In the example there are four alternative partial control schemes with 
quite similar disturbance sensitivity for the uncontrolled output. To decide on the best 
scheme, we should also perform a controllability analysis of the feedback properties 
of the four 1x1 problems. Performing such an analysis, we find that schemes 1 (the 
one chosen) and 4 are preferable, because the input in these two cases has a more 
direct effect on the output, and with less phase lag. 


In conclusion, for this example it is difficult to control both outputs 
simultaneously using feedback control due to strong interactions. However, 
we can almost achieve acceptable control of both outputs by leaving y, 
uncontrolled. The effect of the most difficult disturbance on y; can be further 
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reduced using a simple feedforward controller (10.35) from disturbance d, to 
U4. 


10.8 Decentralized feedback control 


Figure 10.8: Decentralized diagonal control of a 2 x 2 plant 


In this section, G(s) is a square plant which is to be controlled using a diagonal 
controller (see Figure 10.8) 


K(s) = diag{ki(s)} = we (10.36) 


km/(8) 


This is the problem of decentralized diagonal feedback control. The design of 
decentralized control systems involves two steps: 


1. The choice of pairings (control configuration selection) 
2. The design (tuning) of each controller, k;(s). 


The optimal solution to this problem is very difficult mathematically, because 
the optimal controller is in general of infinite order and may be non-unique; 
we do not address it in this book. The reader is referred to the literature 
(e.g. Sourlas and Manousiouthakis, 1995) for more details. Rather we aim 
at providing simple tools for pairing selections (step 1) and for analyzing the 
achievable performance (controllability) of diagonally controlled plants (which 
may assist in step 2). 

Remark. The treatment in this section may be generalized to block-diagonal 


controllers by, for example, introducing tools such as the block relative gain array 
of Manousiouthakis et al. (1986). 
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Notation for decentralized diagonal control. G(s) denotes a square 
m xm plant with elements g;;. G’!(s) denotes the remaining (m—1) x (m—1) 
plant obtained by removing row i and column j in G(s). With a particular 
choice of pairing we can rearrange the columns or rows of G(s) such that 
the paired elements are along the diagonal of G(s). We then have that the 
controller K(s) is diagonal (diag{k;}), and we also introduce 


91 


A 922 


G = diag {gis} = (10.37) 


Imm 
as the matrix consisting of the diagonal elements of G. The loop transfer 
function in loop i is denoted L; = gi:k;, which is also equal to the 7’th diagonal 
element of L=GK. 


The magnitude of the off-diagonal elements in G (the interactions) relative 
to its diagonal elements are given by the matrix 


BL (G-@@? (10.38) 


A very important relationship for decentralized control is given by the 
following factorization of the return difference operator: 


(I+GK)=(I+ET) (I1+GK) (10.39) 
—S —S e—_H_— 
overall interactions individual loops 


or equivalently in terms of the sensitivity function S$ = (I+ GK)", 


S=S(I+ET)} (10.40) 
Here 
aA ws ; 1 Base 
S=(1+GK) = diag{ aie! and T=I-S (10.41) 


contain the sensitivity and complementary sensitivity functions for the 
individual loops. Note that S is not equal to the matrix of diagonal elements 
of S. (10.40) follows from (A.138) with G = G and G’ = G. The reader is 
encouraged to confirm that (10.40) is correct, because most of the important 
results for stability and performance using decentralized control may be 
derived from this expression. An alternative factorization which follows from 
(A.139) is 

S$=(14+ $0 —D)-'ST (10.42) 


where [' is the Performance Relative Gain Array (PRGA), 


I> 


T(s) = G(s)G-1(s) (10.43) 
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which is a scaled inverse of the plant. Note that FE =I~! — J. At frequencies 
where feedback is effective (9 ~ 0), (10.42) yields S ~ ST which shows that 
T is important when evaluating performance with decentralized control. The 
diagonal elements of the PRGA-matrix are equal to the diagonal elements 
of the RGA, yi = Au, and this is the reason for its name. Note that the 
off-diagonal elements of the PRGA depend on the relative scaling on the 
outputs, whereas the RGA is scaling independent. On the other hand, the 
PRGA measures also one-way interaction, whereas the RGA only measures 
two-way interaction. 

We also will make use of the related Closed-Loop Disturbance Gain (CLDG) 
matrix, defined as 


Gals) = T(s)Ga(s) = G(s)\G-"(s)Ga(s) (10.44) 


The CLDG depends on both output and disturbance scaling. 


10.8.1 RGA as interaction measure for decentralized 
control 


We here follow Bristol (1966), and show that the RGA provides a measure 
of the interactions caused by decentralized diagonal control. Let uj and y; 
denote a particular input and output for the multivariable plant G(s), and 
assume that our task is to use u; to control y;. Bristol argued that there will 
be two extreme cases: 


e Other loops open: All other inputs are constant, ie. uz, =0,Vk F 7. 
e Other loops closed: All other outputs are constant, i.e. y, = 0, Vk F i. 


In the latter case, it is assumed that the other loops are closed with perfect 
control. Perfect control is only possible at steady-state, but it is a good 
approximation at frequencies within the bandwidth of each loop. We now 
evaluate the effect Oy;/Ou; of “our” given input u; on “our” given output y; 
for the two extreme cases. We get 


OY; 
Other loops open: ( ut ) = 9ij (10.45) 
OU) 1, =0,b43 
Other loops closed: (Su ) 2 5ij (10.46) 
OU; } y, 0,04 
Here gj; = [G]ij is the ij’th element of G, whereas Gj; is the inverse of the 


ji’th element of G-! 
G3 = 1/[G 1] (10.47) 
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To derive (10.47) note that 


y=Gu = (34) = [Gs (10.48) 
Ou; Up =O, kAj 
and interchange the roles of G and G1, of u and y, and of i and j to get 
u=Gly = (52) = (6s (10.49) 
YS yx=0,k#i 


and (10.47) follows. Bristol argued that the ratio between the gains in (10.45) 
and (10.46), to corresponding the two extreme cases, is a useful measure of 
interactions, and he introduced the term, ij’th relative gain defined as 


rij SZ eg (eaPalcamalee (10.50) 
ij 
The Relative Gain Array (RGA) is the corresponding matrix of relative gains. 
From (10.50) we get A(G) = Gx(G~')? where x denotes element-by-element 
multiplication (the Schur product). This is identical to our definition of the 
RGA-matrix in (3.65). 

Clearly, we would like to pair variables u; and yj so that A;; is close to 
1, because this means that the gain from uj; to y% is unaffected by closing 
the other loops. On the other hand, a pairing corresponding to ;;(0) < 0 is 
clearly undesirable, because it means that the steady-state gain in “our” given 
loop changes sign when the other loops are closed. A more exact statement is 
given later in Theorem 10.6. 


10.8.2 Stability of decentralized control systems 


Consider a square plant with single-loop controllers. For a 2 x 2 plant there are 
two alternative pairings, a 3 x 3 plant offers 6, a 4x 4 plant 24, and an m xm 
plant has m! alternatives. Thus, tools are needed which are capable of quickly 
eliminating inappropriate control structures. In this section we provide two 
useful rules for pairing inputs and outputs: 


1. To avoid instability caused by interactions in the crossover region one 
should prefer pairings for which the RGA-matrix in this frequency range 
is close to identity. 

2. To avoid instability caused by interactions at low frequencies one should 
avoid pairings with negative steady-state RGA elements. 


Sufficient conditions for stability 


For decentralized diagonal control, it is desirable that the system can be 
tuned and operated one loop at a time. Assume therefore that G is stable 
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and each individual loop is stable by itself (S and T are stable). Then from 
the factorization S$ = $(I + ET)~' in (10.39) and Lemma A.11, which is 
a special case of the generalized Nyquist theorem, it follows that the overall 
system is stable ( is stable) if and only if det(I+ET(s)) does not encircle the 
origin as s traverses the Nyquist D-contour. From the spectral radius stability 
condition in (4.110) we then have that the overall system is stable if 


p(ET (jw)) <1,Vw (10.51) 


This sufficient condition for overall stability can, as discussed by Grosdidier 
and Morari (1986), be used to obtain a number of even weaker stability 
conditions. We will consider three approaches. 


1. We use the fact that p(ET) < ||ET|| < ||E|| ||T'|| for any matrix norm, see 
(A.114). For example, we may use any induced norm, || - ||;,, such as the 
maximum singular value (p = 2), maximum column (p = 1) or maximum 
row sum (p = oo). A sufficient condition for overall stability is then 


IT lip = max |ti| <1/||Ellip (10.52) 


2. A better (less conservative) approach is to split up p(ET) using the 
structured singular value. From (8.100) we have p(ET) < p(E)a(T) and 
from (10.51) we get the following theorem (as first derived by Grosdidier 
and Morari, 1986): 


Theorem 10.2 Assume G is stable and that the individual loops are stable 
(T is stable). Then the entire system is closed-loop stable (T is stable) if 


a(T) = max |ti| <1/u(B) Ww (10.53) 


Here y(E) is the structured singular value interaction measure. It is 
desirable to have u(E) small. Note that u(E) is computed with respect 
to the diagonal structure of T (a diagonal matrix), where we may view T 
as the “design uncertainty”. 


Definition 10.3 A plant is (generalized) diagonally dominant at frequen- 
cies where p(E(jw)) <1. 


From (10.53) we can then allow a(T) > 1 (tight control) at such frequencies 
and still be guaranteed closed-loop stability. 


3. A third approach is to use Gershgorin’s theorem, see page 536. From (10.51) 
we then derive the following sufficient condition for overall stability: 


ltésl <|gisl/ $= loss] Vi, Yeo (10.54) 
i#i 
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or alternatively, in terms of the columns, 


[til < lgisl/ S— loyal Vi, Ww (10.55) 
Ft 

Remark 1 We stress again that condition (10.53) is always less conservative than 
(10.52). This follows since 1/(£) is defined to be the tightest upper bound on a(T) 
which guarantees p(ET) < 1, see also (8.86) with A = T, and since T is diagonal 
so a(T) = Fllep = max; |é];. An implication of this is that (FE) < ||Ellip. Since 
u(E) = p(DED~"), see (8.92), where D in this case is diagonal, it then also follows 
from (A.125) that u(E) < p(|E|), where p(|E|) is the Perron root. (Note that these 
upper bounds on p(F) are not general properties of the structured singular value). 


Remark 2 On the other hand, we cannot say that (10.53) is always less 
conservative than (10.54) and (10.55). It is true that the smallest of the i =1,...m 
upper bounds in (10.54) or (10.55) is always smaller (more restrictive) than 1/u(£) 
in (10.53). However, (10.53) imposes the same bound on [é;| for each loop, whereas 
(10.54) and (10.55) give individual bounds, some of which may be less restrictive 


than 1/p(£). 


Remark 3 Another definition of generalized diagonal dominance is that p(|E|) < 1, 
where p(|E|) is the Perron root, see (A.125). However, since as noted above 
LE) < p(|E)), it is better (less restrictive) to use 4(£) to define diagonal dominance. 


Remark 4 Conditions (10.51)-(10.55) are only sufficient for stability, so we may in 
some cases get closed-loop stability even if they are violated; see also Remark 5 on 
page 469. 


Remark 5 Condition (10.53) and the use of u() for (nominal) stability of the 
decentralized control system can be generalized to include robust stability and robust 
performance, see equations (3la-b) in Skogestad and Morari (1989). 


We now want to show that for closed-loop stability it is desirable to select 
pairings such that the RGA is close to the identity matrix in the crossover 
region. The next simple theorem, which applies to a triangular plant, will 
enable us to do this: 


Theorem 10.4 Suppose the plant G(s) is stable. If the RGA-matriz A(G) = 
I Vw then stability of each of the individual loops implies stability of the entire 
system. 


Proof: From the definition of the RGA it follows that A(G) = J can only arise from a 
triangular G(s) or from G(s)-matrices that can be made triangular by interchanging 
rows and columns in such a way that the diagonal elements remain the same but 
in a different order (the pairings remain the same). A plant with a “triangularized” 
transfer matrix (as described above) controlled by a diagonal controller has only one- 
way coupling and will always yield a stable system provided the individual loops are 
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stable. Mathematically, E = (G — G)G-? can be made triangular, and since the 


diagonal elements of EF are zero, it follows that all eigenvalues of ET are ZeYO, SO 
p(ET) = 0 and (10.51) is satisfied. 


RGA at crossover frequencies. In most cases, it is sufficient for overall 
stability to require that G(jw) is close to triangular (or A(G) & I) at 
crossover frequencies. To see this, assume that S is stable, and that ST(s) = 
SG(s)G(s)~! is stable and has no RHP-zeros (which is always satisfied if both 
G and G are stable and have no RHP-zeros). Then from (10.42) the overall 
system is stable (§ is stable) if and only if (I + S(T — I))~ is stable. Here 
S(L — I) is stable, so from the spectral radius stability condition in (4.110) 
the overall system is stable if 


p(S(P—I)(jw)) <1, Ww (10.56) 


At low frequencies, this condition is usually satisfied because S is small. At 
higher frequencies, where the elements in S= diag{s;} approach and possibly 
exceed 1 in magnitude, (10.56) may be satisfied if G(jw) is close to triangular. 
This is because [ — J and thus Scr — I) are then close to triangular, with 
diagonal elements close to zero, so the eigenvalues of $(I—I)(jw) are close to 
zero, (10.56) is satisfied and we have stability of S. This conclusion also holds 
for plants with RHP-zeros provided they are located beyond the crossover 
frequency range. In summary, we have established the following rule. 


Pairing Rule 1. For stability of diagonal decentralized control select 
pairings for which G(jw) is close to triangular, i.e. A(G(jw)) x I, at 
frequencies around crossover. 


This rule establishes the RGA in the crossover region as a useful tool for 
selecting input-output pairs and below we also establish the usefulness of the 
RGA at steady-state. 


Necessary steady-state conditions for stability 


A desirable property of a decentralized control system is that it has integrity, 
i.e. the closed-loop system should remain stable as subsystem controllers are 
brought in and out of service. Mathematically, the system posesses integrity if 
it remains stable when the controller K is replaced by EK where E = diag{e;} 
and the e; take on the values of 0 or 1, i.e. e; € {0,1}. 

An even stronger requirement is that the system remains stable as the gain 
in various loops are reduced (detuned) by an arbitrary factor, i.e. e; € [0,1]. 
We introduce the idea of decentralized integral controllability (DIC) which 
is concerned with whether it is possible to detune a decentralized controller 
when there is integral action in each loop. 
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Definition 10.5 Decentralized Integral Controllability (DIC). The 
plant G(s) (corresponding to a given pairing) is DIC if there exists a 
decentralized controller with integral action in each loop, such that the 
feedback system is stable and such that each individual loop may be detuned 
independently by a factor «; (0 < 6; < 1) without introducing instability. 


Remark 1 DIC was introduced by Skogestad and Morari (1988b). A detailed 
survey of conditions for DIC and other related properties is given by Campo and 
Morari (1994). 


Remark 2 Unstable plants are not DIC. The reason is that with all «; = 0 we are 
left with the uncontrolled plant G, and the system will be (internally) unstable if 
G(s) is unstable. 


Remark 3 For «¢; = 0 we assume that the integrator of the corresponding SISO 
controller has been removed, otherwise the integrator would yield internal instability. 


Note that DIC considers the existence of a controller, so it depends only on 
the plant G and the chosen pairings. The steady-state RGA provides a very 
useful tool to test for DIC, as is clear from the following result which was first 
proved by Grosdidier, Morari and Holt (1985): 


Theorem 10.6 Steady-state RGA Consider a stable square plant G and 
a diagonal controller K with integral action in all elements, and assume that 
the loop transfer function GK is strictly proper. If a pairing of outputs and 
manipulated inputs corresponds to a negative steady-state relative gain, then 
the closed-loop system has at least one of the following properties: 

(a) The overall closed-loop system is unstable. 

(b) The loop with the negative relative gain is unstable by itself. 

(c) The closed-loop system is unstable if the loop with the negative relative 
gain is opened (broken). 


Proof: The theorem may be proved by setting T=Tin (10.39) and applying the 
generalized Nyquist stability condition. Alternatively, we can use Theorem 6.9 on 
page 261 and select G’ = diag{gi,G"}. Since det G’ = gi detG” and from (A.76 


Mii = gateee we have det G' /detG = \;; and Theorem 10.6 follows. 


Each of the three possible instabilities in Theorem 10.6 resulting from pairing 
on a negative value of 4;;(0) is undesirable. The worst case is (a) when the 
overall system is unstable, but situation (c) is also highly undesirable as it will 
imply instability if the loop with the negative relative gain somehow becomes 
inactive, for example, due to input saturation. Situation (b) is unacceptable if 
the loop in question is intended to be operated by itself e.g. it may be at the 
“bottom” of the control hierarchy, or all the other loops may become inactive, 
due to input saturation. 
Theorem 10.6 can then be summarized in the following rule: 
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Pairing Rule 2. Avoid pairings which correspond to negative values of 
Aix (0). 


Consider a given pairing and assume we have reordered the inputs and outputs 
such that G has the paired elements along its diagonal. Then Theorem 10.6 
implies that 


A (reordered) plant G(s) is DIC only if Au (0) > 0 for all i. (10.57) 


The RGA is a very efficient tool because it does not have to be recomputed 
for each possible choice of pairing. This follows since any permutation of the 
rows and columns of G results in the same permutation in the RGA of G. 
To achieve DIC one has to pair on a positive RGA(0)-element in each row 
and column, and therefore one can often eliminate many alternative pairings 
by a simple glance at the RGA-matrix. This is illustrated by the following 
example. 


Example 10.9 Consider a3 x 3 plant with 


10.2 56 14 0.96 1.45 —1.41 
G0) =| 15.5 -8.4 -0.7], A(0)=] 094 —0.37 0.43 (10.58) 
181 04 18 —0.90 —0.07 1.98 


For a3 x3 plant there are 6 alternative pairings, but from the steady-state RGA 
we see that there is only one positive element in column 2 (\i2 = 1.45), and only 
one positive element in row 3 (A33 = 1.98), and therefore there is only one possible 
pairing with all RGA-elements positive (uw Yy2, U2 yi, us 93). Thus, if we 
require DIC we can from a quick glance at the steady-state RGA eliminate five of 
the siz alternative pairings. 


Exercise 10.7 Assume that the 4x 4 matriz in (A.80) represents the steady-state 
model of a plant. Show that 20 of the 24 possible pairings can be eliminated by 
requiring DIC. 

Remarks on DIC and RGA. 


1. For 2 x 2 and 3 x 3 plants we have even tighter conditions for DIC than (10.57). 
For 2 x 2 plants (Skogestad and Morari, 1988b) 


DIC & An(0)>0 (10.59) 


For 3 x 3 plants with positive diagonal RGA-elements of G(0) and of G"'(0),i = 
1,3 (its three principal submatrices) we have (Yu and Fan, 1990) 


DIC &  Vrr1(0) + V/22(0) + V/Aa3(0) > 1 (10.60) 


(Strictly speaking, as pointed out by Campo and Morari (1994), we do not have 


equivalence for the case when J A11(0) + of d22(0) + /A33(0) is identical to 1, 
but this has little practical significance). 
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2. One cannot expect tight conditions for DIC in terms of the RGA for 4x4 systems 
or higher. The reason is that the RGA essentially only considers “corner values” , 
e; = 0 or & = 1 (integrity), for the detuning factor in each loop in the definition 


of DIC. This is clear from the fact that \;; = wider’ | where G corresponds 
to «; = 1 for all i, gi; corresponds to «; = 1 with the other «, = 0, and G” 
corresponds to e; = 0 with the other e; = 1. 

3. Determinant conditions for integrity (DIC). The following condition is 
concerned with whether it is possible to design a decentralized controller for the 
plant such that the system posesses integrity, which is a prerequisite for having 
DIC: 

Assume without loss of generality that the signs of the rows or columns of G have 
been adjusted such that all diagonal elements of G are positive, i.e. gi(0) > 0. 
Then one may compute the determinant of G(0) and all its principal submatrices 
(obtained by deleting rows and corresponding columns in G(0)), which should all 
have the same sign for DIC. 

This determinant condition follows by applying Theorem 6.9 to all possible 
combinations of «; = 0 or 1 as illustrated in the proof of Theorem 10.6, and 
is equivalent to requiring that the so-called Niederlinski indices, 


Ny = det G(0)/Tligii (0) (10.61) 


of G(0) and its principal submatrices are all positive. Actually, this yields more 

information than the RGA, because in the RGA the terms are combined into 

Ai = gui der" so we may have cases where two negative determinants result 
in a positive RGA-element. Nevertheless, the RGA is usually the preferred tool 
because it does not have to be recomputed for each pairing. 

4. DIC is also closely related to D-stability, see papers by Yu and Fan (1990) and 
Campo and Morari (1994). The theory of D-stability provides necessary and 
sufficient conditions except in a few special cases, such as when the determinant 
of one or more of the submatrices is zero. 

5. If we assume that the controllers have integral action, then T(0) = J, and we can 
derive from (10.53) that a sufficient condition for DIC is that G is generalized 
diagonally dominant at steady-state, that is, 


u(E(0)) <1 


This is proved by Braatz (1993, p.154). However, the requirement is only sufficient 
for DIC and therefore cannot be used to eliminate designs. Specifically, for a 
2 x 2 system it is easy to show (Grosdidier and Morari, 1986) that u(E(0)) < 1 
is equivalent to A11(0) > 0.5, which is conservative when compared with the 
necessary and sufficient condition 11(0) > 0 in (10.59). 

6. If the plant has jw-axis poles, e.g. integrators, it is recommended that, prior 
to the RGA-analysis, these are moved slightly into the LHP (e.g. by using very 
low-gain feedback). This will have no practical significance for the subsequent 
analysis. 

7. Since Theorem 6.9 applies to unstable plants, we may also easily extend Theorem 
10.6 to unstable plants (and in this case one may actually desire to pair 
on a negative RGA-element). This is shown in Hovd and Skogestad (1994a). 
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Alternatively, one may first implement a stabilizing controller and then analyze 
the partially controlled system as if it were the plant G(s). 

8. The above results only address stability. Performance is analyzed in Section 
10.8.4. 


10.8.3 The RGA and right-half plane zeros 


Bristol (1966) claimed that negative values of \;;(0) implied the presence of 
RHP-zeros. This is not quite true, and the correct statement is (Hovd and 
Skogestad, 1992): 


Theorem 10.7 Consider a transfer function matrix with stable elements and 
no zeros or poles at s = 0. Assume lims_,oo Ai; (8) is finite and different 
from zero. If rij(joo) and 4; (0) have different signs then at least one of 
the following must be true: 

a) The element g;;(s) has a RHP-zero. 

b) The overall plant G(s) has a RHP-zero. 

c) The subsystem with input j and output i removed, GY) (s), has a RHP-zero. 


Any such zero may be detrimental for decentralized control. In most cases 
the pairings are chosen such that ;(00) is positive (usually close to 1, see 
Pairing rule 1) which then confirm, Bristol’s claim that a negative 2,;(0) 
implies there is a RHP-zero in some subsystem. 


Example 10.10 Consider a plant 


a s+1 5+4 
a) = As ( 1 9 ) (10.62) 


We find that \11(co) = 2 and A11(0) = —1 have different signs. Since none of the 
diagonal elements have RHP-zeros we conclude from Theorem 10.7 that G(s) must 
have a RHP-zero. This is indeed true and G(s) has a zero at s = 2. 


10.8.4 Performance of decentralized control systems 
In the following, we consider performance in terms of the control error 
e=y—-r=Gu+Gad—r (10.63) 


Suppose the system has been scaled as outlined in Chapter 1.4, such that at 
each frequency: 


1. Each disturbance is less than 1 in magnitude, |d,| < 1. 
2. Each reference change is less than the corresponding diagonal element in 
R, Ir;| < R;. 
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3. For each output the acceptable control error is less than 1, |e;| < 1. 


For SISO systems, we found in Section 5.10 that in terms of scaled variables 
we must at all frequencies require 


[1+L|>|Gal and |14+ Z| > |R| (10.64) 


for acceptable disturbance rejection and command tracking, respectively. Note 
that L,Gq and R are all scalars in this case. For decentralized control these 
requirements may be directly generalized by introducing the PRGA-matrix, 
T = GG", in (10.43) and the CLDG-matrix, Gy = TG, in (10.44). These 
generalizations will be presented and discussed next, and then subsequently 
proved. 

Single disturbance. Consider a single disturbance, in which case Gq is 
a vector, and let gg; denote the i’th element of Gg. Let L; = gi:k; denote 
the loop transfer function in loop 7. Consider frequencies where feedback is 
effective so ST is small (and (10.67) is valid). Then for acceptable disturbance 
rejection (|e;| < 1) we must with decentralized control require for each loop #, 


[1 + D;| > [Gas Vi (10.65) 


which is the same as the SISO-condition (5.55) except that Gg is replaced by 
the CLDG, ga;. In words, ga; gives the “apparent” disturbance gain as seen 
from loop i when the system is controlled using decentralized control. 

Single reference change. Similarly, consider a change in reference for 
output j of magnitude R;. Consider frequencies where feedback is effective 
(and (10.67) is valid). Then for acceptable reference tracking (|e;| < 1) we 
must require for each loop 7 


b+ Lil > big) IRyl Vi (10.66) 


which is the same as the SISO-condition except for the PRGA-factor, |y;|. In 
other words, when the other loops are closed the response in loop 7 gets slower 
by a factor |7;;|. Consequently, for performance it is desirable to have small 
elements in I, at least at frequencies where feedback is effective. However, 
at frequencies close to crossover, stability is the main issue, and since the 
diagonal elements of the PRGA and RGA are equal. we usually prefer to have 
i Close to 1 (recall pairing rule 1 on page 466). 


Proof of (10.65) and (10.66): At frequencies where feedback is effective, S is small, 
so 
1480-NDeI (10.67) 
and from (10.42) we have 
Sx Sr (10.68) 
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The closed-loop response then becomes 
e = SGad — Sr & SGad — STr (10.69) 


and the response in output 7 to a single disturbance d; and a single reference change 
r; is 


C4 © SiGaikdk — SiVirTh (10.70) 
where 3; = 1/(1 + gik;) is the sensitivity function for loop i by itself. Thus, to 
achieve |e;| < 1 for |d,| = 1 we must require |S;gaiz| < 1 and (10.65) follows. 
Similarly, to achieve |e;| < 1 for |r;| = |R;| we must require |s;yinRj| < 1 and 


(10.66) follows. Also note that |s;7;;| < 1 will imply that assumption (10.67) is 
valid. Since R usually has all of its elements larger than 1, in most cases (10.67 
will be automatically satisfied if (10.66) is satisfied, so we normally need not check 
assumption (10.67). 


Remark 1 (10.68) may also be derived from (10.40) by assuming T x I which 
yields (I+ ET)~' & (1+ EE)! =P. 


Remark 2 Consider a particular disturbance with model gq. Its effect on output 7 
with no control is ga:, and the ratio between Gai (the CLDG) and ga; is the relative 
disturbance gain (RDG) ((;) of Stanley, Marino-Galarraga and McAvoy (1985) (see 
also Skogestad and Morari (1987b)): 


6: © Gai/gai = [EG~ ga]: [gals (10.71) 


Thus (;, which is scaling independent, gives the change in the effect of the 
disturbance caused by decentralized control. It is desirable to have @; small, as 
this means that the interactions are such that they reduce the apparent effect of the 
disturbance, such that one does not need high gains |L;| in the individual loops. 


10.8.5 Summary: Controllability analysis for 
decentralized control 


When considering decentralized diagonal control of a plant, one should first 
check that the plant is controllable with any controller. If the plant is unstable, 
then as usual the unstable modes must be controllable and observable. In 
addition, the unstable modes must not be decentralized fired modes, otherwise 
the plant cannot be stabilized with a diagonal controller (Lunze, 1992). For 
example, this is the case for a triangular plant if the unstable mode appears 
only in the off-diagonal elements. 

The next step is to compute the RGA-matrix as a function of frequency, 
and to determine if one can find a good set of input-output pairs bearing in 
mind the following 


1. Prefer pairings which have the RGA-matrix close to identity at frequencies 
around crossover, i.e. the RGA-number ||A(jw) — || should be small. This 
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rule is to ensure that interactions from other loops do not cause instability 
as discussed following (10.56). 
2. Avoid a pairing ij with negative steady-state RGA elements, \,;(G(0)). 
3. Prefer a pairing ij where g;; puts minimal restrictions on the achievable 
bandwidth. Specifically, the frequency wy; where Zgij(jwusj) = —180° 
should be as large as possible. 
This rule favours pairing on variables “close to each other”, which makes 
it easier to satisfy (10.65) and (10.66) physically while at the same time 
achieving stability. It is also consistent with the desire that A(jw) is close 
to I. 


When a reasonable choice of pairings has been made, one should rearrange G 
to have the paired elements along the diagonal and perform a controllability 
analysis. 


4. Compute the CLDG and PRGA, and plot these as a function of frequency. 

5. For systems with many loops, it is best to perform the analysis one loop at 
the time, that is, for each loop 2, plot |Gaix| for each disturbance k and plot 
ly«j| for each reference j (assuming here for simplicity that each reference 
is of unit magnitude). For performance, we need |1 + L;| to be larger than 
each of these 


Performance: |1+ L;|> max{|Gaik|, Ilyas |} (10.72) 
J 


To achieve stability of the individual loops one must analyze g;;(s) to ensure 
that the bandwidth required by (10.72) is achievable. Note that RHP- 
zeros in the diagonal elements may limit achievable decentralized control, 
whereas they may not pose any problems for a multivariable controller. 
Since with decentralized control we usually want to use simple controllers, 
the achievable bandwidth in each loop will be limited by the frequency 
where /giz is —180° (recall Chapter 5.12). 

6. As already mentioned one may check for constraints by considering 
the elements of G~'Gg and making sure that they do not exceed 
one in magnitude within the frequency range where control is needed. 
Equivalently, one may for each loop i plot |g;:|, and the requirement is 
then that 


To avoid input constraints: |gi| > |Gaie|, Vk (10.73) 


at frequencies where |ga;,| is larger than 1 (this follows since Ga = 
(elemrery This provides a direct generalization of the requirement |G| > 
|Ga| for SISO systems. The advantage of (10.73) compared to using G-1G4 
is that we can limit ourselves to frequencies where control is needed to reject 
the disturbance (where |Guiz| > 1). 
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If the plant is not controllable, then one may consider another choice of 
pairings and go back to Step 4. If one still cannot find any pairings which are 
controllable, then one should consider multivariable control. 


7. If the chosen pairing és controllable then the analysis based on (10.72) tells 
us directly how large |L;| = |g;;k;| must be, and can be used as a basis for 
designing the controller k;(s) for loop i. 


Remark. In some cases, pairings which violate the above rules may be chosen. 
For example, one may even choose to pair on elements with gi; = 0 which yield 
ii = 0. One then relies on the interactions to achieve the desired performance as 
loop 7 by itself has no effect. An example of this is in distillation control when the 
LV -configuration is not used, see Example 10.6. 


Example 10.11 Application to distillation process. In order to 
demonstrate the use of the frequency dependent RGA and CLDG for evaluation 
of expected diagonal control performance, we consider again the distillation process 
used in Example 10.8. The LV configuration is used, that is, the manipulated inputs 
are refluz L (ui) and boilup V (uz). Outputs are the product compositions yp (y1) 
and xp (y2). Disturbances in feed flowrate F (d) and feed composition zr (d2), are 
included in the model. The disturbances and outputs have been scaled such that a 
magnitude of 1 corresponds to a change in F of 20%, a change in zr of 20%, and 
a change in xp and yn of 0.01 mole fraction units. The 5-state dynamic model is 
given in Section 12.4. 

Initial controllability analysis. G(s) is stable and has no RHP-zeros. The 
plant and RGA-matrix at steady-state are 


87.8 Amel i= | 2 a 


0) =| Ne —109.6 —34.1 35.1 ON) 


The RGA-elements are much larger than 1 and indicate a plant that is fundamentally 
difficult to control. Fortunately, the flow dynamics partially decouple the response 
at higher frequencies, and we find that A(jw) = I at frequencies above about 0.5 
rad/min. Therefore if we can achieve sufficiently fast control, the large steady-state 
RGA-elements may be less of a problem. 

The steady-state effect of the two disturbances is given by 


7.88 8.81 ] (10.75) 


Ga(0) = Ke 11.19 


and the magnitudes of the elements in Ga(jw) are plotted as a function of frequency 
in Figure 10.9. From this plot the two disturbances seem to be equally difficult to 
reject with magnitudes larger than 1 up to a frequency of about 0.1 rad/min. We 
conclude that control is needed up to 0.1 rad/min. The magnitude of the elements 
in G~'Ga(jw) (not shown) are all less than 1 at all frequencies (at least up to 10 
rad/min), and so it will be assumed that input constraints pose no problem. 

Choice of pairings. The selection of ui; to control y, and uz to control 
y2, corresponds to pairing on positive elements of A(O) and A(jw) & TI at high 
frequencies. This seems sensible, and is used in the following. 
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Figure 10.9: Disturbance gains |gaix|, for effect of disturbance k on output #. 
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Figure 10.10: Closed-loop disturbance gains, |gaix|, for effect of disturbance k on 
output 7. 
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Figure 10.11: PRGA-elements |7;| for effect of reference j on output 7. 
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Figure 10.12: Decentralized PI-control. Responses to a unit step in d; at t = 0 and 
a unit step in dz at t = 50 min. 


Analysis of decentralized control. The elements in the CLDG and PRGA 
matrices are shown as functions of frequency in Figures 10.10 and 10.11. At steady- 
state we have 


roy=[ 2, 28]. dan=norcuin= [327 28] coz 


In this particular case the off-diagonal elements of RGA (A) and PRGA (T) are quite 
similar. We note that Ga(0) is very different from Ga(0), and this also holds at higher 
frequencies. For disturbance 1 (first column in Ga) we find that the interactions 
increase the apparent effect of the disturbance, whereas they reduce the effect of 
disturbance 2, at least on output 1. 

We now consider one loop at a time to find the required bandwidth. For loop 1 
(output 1) we consider 711 and y12 for references, and Qai1 and Ga12 for disturbances. 
Disturbance 1 is the most difficult, and we need |1 + 11| > |gaii| at frequencies 
where |gaii| is larger than 1, which is up to about 0.2 rad/min. The magnitude of 
the PRGA-elements are somewhat smaller than |gaii| (at least at low frequencies), 
so reference tracking will be achieved if we can reject disturbance 1. From ga12 we 
see that disturbance 2 has almost no effect on output 1 under feedback control. 

Also, for loop 2 we find that disturbance 1 is the most difficult, and from Gai2 we 
require a loop gain larger than 1 up to about 0.3 rad/min. A bandwidth of about 0.2 
to 0.3 rad/min in each loop, is required for rejecting disturbance 1, and should be 
achievable in practice. 

Observed control performance. To check the validity of the above results we 
designed two single-loop PI controllers: 
1+3.76s | 143.31s 


k =-0. 10. 
3.768) 2(s) 0.375 331s (10.77) 


k1(s) = 0.261 


The loop gains, Li = giiki, with these controllers are larger than the closed-loop 
disturbance gains, |6;,|, at frequencies up to crossover. Closed-loop simulations with 
these controllers are shown in Figure 10.12. The simulations confirm that disturbance 
2 is more easily rejected than disturbance 1. 
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In summary, there is an excellent agreement between the controllability 
analysis and the simulations, as has also been confirmed by a number of other 
examples. 


10.8.6 Sequential design of decentralized controllers 


The results presented in this section on decentralized control are most useful 
for the case when the local controllers k;(s) are designed independently, that 
is, each controller is designed locally and then all the loops are closed. As 
discussed above, one problem with this is that the interactions may cause the 
overall system (T) to be unstable, even though the local loops (T’) are stable. 
This will not happen if the plant is diagonally dominant, such that we satisfy, 
for example, (1) < 1/p(E) in (10.53). 

The stability problem is avoided if the controllers are designed sequentially 
as is commonly done in practice when, for example, the bandwidths of the 
loops are quite different. In this case the outer loops are tuned with the inner 
(fast) loops in place, and each step may be considered as a SISO control 
problem. In particular, overall stability is determined by m SISO stability 
conditions. However, the issue of performance is more complicated because 
the closing of a loop may cause “disturbances” (interactions) into a previously 
designed loop. The engineer must then go back and redesign a loop that has 
been designed earlier. Thus sequential design may involve many iterations, 
also see Hovd and Skogestad (1994b). The performance bounds in (10.72) are 
useful for determining the required bandwidth in each loop and may thus 
suggest a suitable sequence in which to design the controllers. 

Although the analysis and derivations given in this section apply when we 
design the controllers sequentially, it is often useful, after having designed a 
lower-layer controller (the inner fast loops), to redo the analysis based on the 
model of the partially controlled system using (10.30) or (10.32). For example, 
this is usually done for distillation columns, where we base the analysis of the 
composition control problem on a 2 x 2 model of the partially controlled 5 x 5 
plant, see Examples 10.6 and 10.11. 


10.8.7 Conclusion on decentralized control 


In this section, we have derived a number of conditions for the stability, e.g. 
(10.53) and (10.57), and performance, e.g. (10.65) and (10.66), of decentralized 
control systems. The conditions may be useful in determining appropriate 
pairing of inputs and outputs and the sequence in which the decentralized 
controllers should be designed. Recall, however, that in many practical cases 
decentralized controllers are tuned based on local models or even on-line. The 
conditions/bounds are also useful in an input-output controllability analysis 
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for determining the viability of decentralized control. 
Some exercises which include a controllability analysis of decentralized 
control are given at the end of Chapter 6. 


10.9 Conclusion 


The issue of control strcture design is very important in applications, but it 
has received relatively little attention in the control community during the 
last 40 years. In this chapter we have discussed the issues involved, and we 
have provided some ideas and tools. There is clearly a need for better tools 
and theory in this area. 


11 


MODEL REDUCTION 


This chapter describes methods for reducing the order of a plant or controller model. 

We place considerable emphasis on reduced order models obtained by residualizing 
the less controllable and observable states of a balanced realization. We also 
present the more familiar methods of balanced truncation and optimal Hankel norm 
approximation. 


11.1 Introduction 


Modern controller design methods such as H.. and LQG, produce controllers 
of order at least equal to that of the plant, and usually higher because of 
the inclusion of weights. These control laws may be too complex with regards 
to practical implementation and simpler designs are then sought. For this 
purpose, one can either reduce the order of the plant model prior to controller 
design, or reduce the controller in the final stage, or both. 

The central problem we address is: given a high-order linear time-invariant 
stable model G, find a low-order approximation G, such that the infinity (Hoo 
or £4.) norm of the difference, ||G —Ga|loo, is small. By model order, we mean 
the dimension of the state vector in a minimal realization which is sometimes 
called the McMillan degree. 

So far in this book we have only been interested in the infinity (Hoo) norm 
of stable systems. But the error G — G, may be unstable and the infinity 
norm needs to be extended to unstable systems. £., defines the set of rational 
functions which have no poles on the imaginary axis, it includes H.., and its 
norm (like H.) is given by ||G||..0 = sup,, ¢ (G(jw)). 

We will describe three main methods for tackling this problem: balanced 
truncation, balanced residualization and optimal Hankel norm approximation. 
Each method gives a stable approximation and a guaranteed bound on the 
error in the approximation. We will further show how the methods can be 
employed to reduce the order of an unstable model G. All these methods 
start from a special state-space realization of G referred to as balanced. We 
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will describe this realization, but first we will show how the techniques of 
truncation and residualization can be used to remove the high frequency or 
fast modes of a state-space realization. 


11.2 Truncation and residualization 


Let (A,B,C,D) be a minimal realization of a stable system G(s), and 


Ly 


partition the state vector x, of dimension n, into where 22 is the vector 


of n — k states which we wish to remove. With appropriate partitioning of A, 
B and C, the state-space equations become 


Ly = Ay Ajo% Byu (11.1) 
bg = Apia, + Aoot%e + Bou 
y= C21 te Cox a Du 


11.2.1. Truncation 


A k-th order truncation of the realization G = (A,B,C,D) is given by 
Gee (Ay1, Bi,Ci,D). The truncated model G, is equal to G at infinite 
frequency, G'(co) = G,(co) = D, but apart from this there is little that can 
be said in the general case about the relationship between G and G4. If, 
however, A is in Jordan form then it is easy to order the states so that x2 
corresponds to high frequency or fast modes. This is discussed next. 

Modal Truncation. For simplicity, assume that A has been diagonalized 
so that 


Mm 0 = 0 br 
BAe tte 0 of 

A= 7 ‘ : 7 B= , C= [ey Cl ct Cn | (11.2) 
0 O -+ Xn or 


n 


Then, if the A; are ordered so that |Ai| < |A2| < ---, the fastest modes are 
removed from the model after truncation. The difference between G and G, 
following a k-th order model truncation is given by 


6262 5 Se (11.3) 
aaa s—; ; 
i=k4+1 
and therefore 
a a(cibT) 
|G — Galloo < a (11.4) 
2 |Re(A;)| 
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It is interesting to note that the error depends on the residues c,;b] as well 
as the \;. The distance of \; from the imaginary axis above is therefore not 
a reliable indicator of whether the associated mode should be included in the 
reduced order model or not. 

An advantage of modal truncation is that the poles of the truncated model 
are a subset of the poles of the original model and therefore retain any physical 
interpretation they might have e.g. the phugoid mode in aircraft dynamics. 


11.2.2 Residualization 


In truncation, we discard all the states and dynamics associated with 22. 
Suppose that instead of this we simply set #2 = 0, i.e. we residualize xo, in 
the state-space equations. One can then solve for x in terms of 7; and u and 
back substitution of rz, then gives 


£41 = (Aut a Aya A55 Aoi) 21 ae (By = Aj2A55 Bo)u (11.5) 
y= (C1 = Cz A55 Ani) 21 + (D == C2 A355 Bo)u (11.6) 


Let us assume Age is invertible and define 


Ay, SS Ais Ay (11.7) 
Be & B= AASB; (11.8) 
Ce = Cpa CA Ay (11.9) 
De CAs Bs, (11.10) 


The reduced order model Ga(s) = (A,;, Br, Cr, Dr) is called a residualization 
of G(s) = (A,B,C,D). Usually (A,B,C,D) will have been put into 
Jordan form, with the eigenvalues ordered so that x contains the fast 
modes. Model reduction by residualization is then equivalent to singular 
perturbational approximation, where the derivatives of the fastest states are 
allowed to approach zero with some parameter «. An important property 
of residualization is that it preserves the steady-state gain of the system, 
G,(0) = G(0). This should be no surprise since the residualization process 
sets derivatives to zero, which are zero anyway at steady-state. But it is in 
stark contrast to truncation which retains the system behaviour at infinite 
frequency. This contrast between truncation and residualization follows from 
the simple bilinear relationship s > L which relates the two (e.g. Liu and 
Anderson, 1989). 

It is clear from the discussion above that truncation is to be preferred when 
accuracy is required at high frequencies, whereas residualization is better for 
low frequency modelling. 
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Both methods depend to a large extent on the original realization and we 
have suggested the use of the Jordan form. A better realization, with many 
useful properties is the balanced realization which will be considered next. 


11.3. Balanced realizations 


In words only: A balanced realization is an asymptotically stable minimal 
realization in which the controllability and observability Gramians are equal 
and diagonal. 

More formally: Let (A, B, C, D) be a minimal realization of a stable, rational 
transfer function G(s), then (A, B,C, D) is called balanced if the solutions to 
the following Lyapunov equations 


AP+PA™+BB™ = 0 (11.11) 
ATQ+QA+CTC = 0 (11.12) 


are P = Q = diag(a1,02,..-,0n) S S, where 0, > 02 >... >0n >0. P and 
Q are the controllability and observability Gramians, also defined by 


I> 


P | ec“! BBT eA "tat (11.13) 


0 


I> 


Q 7 At CT CeAtdt (11.14) 


0 
¥ is therefore simply referred to as the Gramian of G(s). The oj are the ordered 


Hankel singular values of G(s), more generally defined as o; = d? (PQ), i = 
1,...,n. Notice that 0, = ||G||#, the Hankel norm of G(s). 

Any minimal realization of a stable transfer function can be balanced by a 
simple state similarity transformation, and routines for doing this are available 
in MATLAB. For further details on computing balanced realizations, see Laub, 
Heath, Page and Ward (1987). Note that balancing does not depend on D. 

So what is so special about a balanced realization? In a balanced realization 
the value of each o; is associated with a state x; of the balanced system. And 
the size of o; is a relative measure of the contribution x; that makes to the 
input-output behaviour of the system, also see the discussion on page 167. 
Therefore if a; >> oc», then the state 7, affects the input-output behaviour 
much more than #2, or indeed any other state because of the ordering of the o;. 
After balancing a system, each state is just as controllable as it is observable, 
and a measure of a state’s joint observability and controllability is given by its 
associated Hankel singular value. This property is fundamental to the model 
reduction methods in the remainder of this chapter which work by removing 
states having little effect on the system’s input-output behaviour. 
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11.4 Balanced truncation and balanced 
residualization 


Let the balanced realization (A,B,C, D) of G(s) and the corresponding © be 
partitioned compatibly as 


Ai At By 
A= B= C=(C% C 11.15 
ee eae LB | Eee sa) Ve 
fe 0 
=| ; =| (11.16) 
where ©y = diag(o1,02,...,0%), No = diag(ony1,0442,---;0n) and op, > 


Ok+1- 

Balanced truncation. The reduced order model given by (A11, Bi, C1, D) 
is called a balanced truncation of the full order system G(s). This idea of 
balancing the system and then discarding the states corresponding to small 
Hankel singular values was first introduced by Moore (1981). A balanced 
truncation is also a balanced realization (Pernebo and Silverman, 1982), 
and the infinity norm of the error between G(s) and the reduced order 
system is bounded by twice the sum of the last n — k Hankel singular 
values, i.e., twice the trace of 2g or simply “twice the sum of the tail” 
(Glover, 1984; Enns, 1984). For the case of repeated Hankel singular values, 
Glover (1984) shows that each repeated Hankel singular value is to be counted 
only once in calculating the sum. 

A precise statement of the bound on the approximation error is given in 
Theorem 11.1 below. 

Useful algorithms that compute balanced truncations without first 
computing a balanced realization have been developed by Tombs and 
Postlethwaite (1987) and Safonov and Chiang (1989). These still require the 
computation of the observability and controllability Gramians which can be 
a problem if the system to be reduced is of very high order. In such cases the 
technique of Jaimoukha, Kasenally and Limebeer (1992), based on computing 
approximate solutions to Lyapunov equations, is recommended. 

Balanced residualization. In balanced truncation above, we discarded 
the least controllable and observable states corresponding to Uy. In balanced 
residualization, we simply set to zero the derivatives of all these states. The 
method was introduced by Fernando and Nicholson (1982) who called it a 
singular perturbational approximation of a balanced system. The resulting 
balanced residualization of G(s) is (Ar, Br,C,,D,) as given by the formulae 
(11.7)-(11.10). 

Liu and Anderson (1989) have shown that balanced residualization enjoys 
the same error bound as balanced truncation. An alternative derivation of 
the error bound, more in the style of Glover (1984), is given by Samar, 
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Postlethwaite and Gu (1995). A precise statement of the error bound is given 
in the following theorem. 


Theorem 11.1 Let G(s) be a stable rational transfer function with Hankel 
singular values 0, > 02 >... > on where each o; has multiplicity r; and 
let G*(s) be obtained by truncating or residualizing the balanced realization of 
G(s) to the first (r) +ro+...+1,) states. Then 


G(s) — GED lloo < 2(oes1 + ona +... ton): (11.17) 


The following two exercises are to emphasize that (i) balanced truncation 
preserves the steady state-gain of the system and (ii) balanced residualization 
is related to balanced truncation by the bilinear transformation s + s~'. 
Exercise 11.1 The steady-state gain of a full order balanced system (A, B,C, D) 
is D—-CA™'B. Show, by algebraic manipulation, that this is also equal to Dy — 
C,A;'B,, the steady-state gain of the balanced residualization given by (11.7)- 
(11.10). 


Exercise 11.2 Let G(s) have a balanced realization [at]. then 


At |  A'B 
-CAT|D-—CA'B 


is a balanced realization of H(s) & G(s~'), and the Gramians of the two realizations 

are the same. 

1. Write down an expression for a balanced truncation H;(s) of H(s). 

2. Apply the reverse transformation s-' — 8 to Hi(s), and hence show that 
G,(s) & Hi(s~') is a balanced residualization of G(s) as defined by (11.7)- 
(11.10). 


11.5 Optimal Hankel norm approximation 


In this approach to model reduction, the problem that is directly addressed 
is the following: given a stable model G(s) of order (McMillan degree) n, find 
a reduced order model G(s) of degree k such that the Hankel norm of the 
approximation error, ||G(s) — G*(s)||s, is minimized. 

The Hankel norm of any stable transfer function E(s) is defined as 


E(s)|la = p? (PQ) (11.18) 


where P and Q are the controllability and observability Gramians of E(s). It 
is also the maximum Hankel singular value of E(s). So in the optimization 
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we seek an error which is in some sense closest to being completely 
unobservable and completely uncontrollable, which seems sensible. A more 
detailed discussion of the Hankel norm was given in Section 4.10.4. 

The Hankel norm approximation problem has been considered by many 
but especially Glover (1984). In Glover (1984) a complete treatment of the 
problem is given, including a closed-form optimal solution and a bound on 
the infinity norm of the approximation error. The infinity norm bound is of 
particular interest because it is better than that for balanced truncation and 
residualization. 

The theorem below gives a particular construction for optimal Hankel norm 
approximations of square stable transfer functions. 


Theorem 11.2 Let G(s) be a stable, square, transfer function G(s) with 
Hankel singular values 0, > 0g > +++ > Op > Opgy = Oegg =*°* = Ong > 
Ohtl41 2+ > On > 0, then an optimal Hankel norm approximation of order 
k, Gk(s), can be constructed as follows. 

Let (A, B,C,D) be a balanced realization of G(s) with the Hankel singular 
values reordered so that the Gramian 


yo= diag (01,02,°** Oh, ORAL, Ons Th415 °°, Ok41) (11.19) 


A : 
= diag (“1, 0n411) 


Partition (A, B,C,D) to conform with ¥: 


= Ai Aig a By Ze 
a=(4" | Pele | C=[ CO] (11.20) 


A [7 (07 AF, + 1 AuE1 — 0C7UBT) (11.21 
8-211) (R, $eCrU) (11.22 
CG 2 GS, +ouBT (11.23 
D = Do (11.24 
eae ae (11.25 
where U is a unitary matrix satisfying 
By = —CfU (11.26) 


and 
pey?-@ (11.27) 


486 MULTIVARIABLE FEEDBACK CONTROL 


Gk(s) + F(s) 2 Ale | (11.28) 


where Gk (s) is a stable optimal Hankel norm approximation of order k, and 
F(s) is an anti-stable (all poles in the open right-half plane) transfer function 
of ordern —k—1. The Hankel norm of the error between G and the optimal 
approximation Gt is equal to the (k +1) th Hankel singular value of G: 


Then 


|G — Gill = on41(G) (11.29) 


Remark 1 The McMillan degrees of Gi and F depend on the o chosen. If ¢ = on, 
the smallest Hankel singular value, then F' = 0, otherwise (A, B,C, D) has a non- 
zero anti-stable part and G* has to be separated from F. 


Remark 2 For non-square systems, an optimal Hankel norm approximation can be 
obtained by first augmenting G(s) with zero to form a square system. For example, 


if G(s) is flat, define G(s) 2 ee which is square, and let Ga(s) = fae bea 
k-th order optimal Hankel norm approximation of G(s) such that ||G(s)—G,(s) lz = 
COk+1 (G(s)). Then 


or41 (G(s)) < [IG — Galle < [|G — Galle = on41(G@) = on41(@) 


Consequently, this implies that ||G — Gillw = on41(G) and Gi(s) is an optimal 
Hankel norm approximation of G(s). 


Remark 3 The Hankel norm of a system does not depend on the D-matrix in 
the system’s state-space realization. The choice of the D-matrix in Gi is therefore 
arbitrary except when F' = 0, in which case it is equal to D. 


Remark 4 The infinity norm does depend on the D-matrix, and therefore the D- 
matrix of G* can be chosen to reduce the infinity norm of the approximation error 
(without changing the Hankel norm). Glover (1984) showed that through a particular 
choice of D, called D,, the following bound could be obtained: 


|G — Gk — Dolloo < on41 +6 (11.30) 


where 


n—-k-l 


n—k-l 
5 2 ||F-Dollo < S> ai (F(-s)) < So oi4n4s (Gls) (11.31) 


i=1 


This results in an infinity norm bound on the approximation error equal to the 
“sum of the tail”, namely o74.1 + 044141 +::: +n. Notice that the repeated Hankel 
singular value o;41 is only included once. Recall that the bound for the error in 
balanced truncation and balanced residualization is twice the “sum of the tail”. 
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11.6 Two practical examples 


In this section, we make comparisons between the three main model reduction 
techniques presented by applying them to two practical examples. The first 
example is on the reduction of a plant model and the second considers the 
reduction of a two degrees-of-freedom controller. 


11.6.1 Reduction of a gas turbine aero-engine model 


For the first example, we consider the reduction of a model of a Rolls Royce 
Spey gas turbine engine. This engine will be considered again in Chapter 
12. The model has 3 inputs, 3 outputs, and 15 states. Inputs to the engine 
are fuel flow, variable nozzle area and an inlet guide vane with a variable 
angle setting. The outputs to be controlled are the high pressure compressor’s 
spool speed, the ratio of the high pressure compressor’s outlet pressure to 
engine inlet pressure, and the low pressure compressor’s exit mach number 
measurement. The model describes the engine at 87% of maximum thrust with 
sea-level static conditions. The Hankel singular values for the 15 state model 
are listed in Table 11.1 below. Recall that the £2, error bounds after reduction 
are “twice the sum of the tail” for balanced residualization and balanced 
truncation and the “sum of the tail” for optimal Hankel norm approximation. 
Based on this we decided to reduce the model to 6 states. 


1) 2.0005e+01 | 6) 6.2964e-01 | 11) 1.3621¢-02 
2) 4.0464e+00 | 7) 1.6689e-01 | 12) 3.9967e-03 
3) 2.7546e+00 | 8) 9.3407e-02 | 13) 1.1789e-03 
4) 1.7635e+00 | 9) 2.2193e-02 | 14) 3.2410e-04 
5) 1.2965e+00 | 10) 1.5669e-02 | 15) 3.3073e-05 


Table 11.1: Hankel singular values of the gas turbine aero-engine model. 


Figure 11.1 shows the singular values (not Hankel singular values) of the 
reduced and full order models plotted against frequency for the residualized, 
truncated and optimal Hankel norm approximated cases respectively. The D 
matrix used for optimal Hankel norm approximation is such that the error 
bound given in (11.30) is met. It can be seen that the residualized system 
matches perfectly at steady-state. The singular values of the error system 
(G — G,), for each of the three approximations are shown in Figure 11.2 (a). 
The infinity norm of the error system is computed to be 0.295 for balanced 
residualization and occurs at 208 rad/s; the corresponding error norms for 
balanced truncation and optimal Hankel norm approximation are 0.324 and 
0.179 occurring at 169 rad/sec and 248 rad/sec respectively. The theoretical 
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Figure 11.1: Singular values o;(G) and o;(G.) for model reductions of Aero-engine 
from 15 to 6 states. 
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Figure 11.2: Singular values o;(G — G,) for scaled and unscaled error systems. 


upper bounds for these error norms are 0.635 (twice the sum of the tail) for 
residualization and truncation, and 0.187 (using (11.30)) for optimal Hankel 
norm approximation respectively. It should be noted that the plant under 
consideration is desired to have a closed-loop bandwidth of around 10 rad/sec. 
The error around this frequency, therefore, should be as small as possible 
for good controller design. Figure 11.2 (a) shows that the error for balanced 
residualization is the smallest in this frequency range. 

Steady-state gain preservation. It is sometimes desirable to have the 
steady-state gain of the reduced plant model the same as the full order model. 
For example, this is the case if we want to use the model for feedforward 
control. The truncated and optimal Hankel norm approximated systems do 
not preserve the steady-state gain and have to be scaled, i.e., the model 
approximation G, is replaced by GaW;, where W, = G.(0)~'G(0), G 
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being the full order model. The scaled system no longer enjoys the bounds 
guaranteed by these methods and ||G — G,Ws||.. can be quite large as is 
shown in Figure 11.2 (b). Note that the residualized system does not need 
scaling, and the error system for this case has been shown again only for 
ease of comparison. The infinity norms of these errors are computed and are 
found to degrade to 5.71 (at 151 rad/sec) for the scaled truncated system 
and 2.61 (at 168.5 rad/sec) for the scaled optimal Hankel norm approximated 
system. The truncated and Hankel norm approximated systems are clearly 
worse after scaling since the errors in the critical frequency range around 
become large despite the improvement at steady-state. Hence residualization 
is to be preferred over these other techniques whenever good low frequency 
matching is desired. 


50h —50F -50F 0-7 
meal 71007 Reduced: -- “100 Reduced: - - 
-150 Full order: — -150 Full order: —— -150 Full order: — 
0 0.01 0.02 0.03 0 0.01 0.02 0.03 0 0.01 0.02 0.03 
Time [sec] Time [sec] Time [sec] 
(a) Balanced residual- (b) Scaled balanced (c) Scaled optimal Han- 
ization. truncation. kel norm approxima- 


Figure 11.3: 


tion. 


Aero-engine: Impulse responses (2nd input). 


Reduced: - - Reduced: - - Nees Reduced: - - ae 
-15} Full order: — -15} Full order: — SS -15} Full order: — SS 
0 0.5, 1 0 0.5 1 0 0.5, 1 
Time [sec] Time [sec] Time [sec] 
(a) Balanced residual- (b) Scaled balanced (c) Scaled optimal Han- 
ization. truncation. kel norm approxima- 


tion. 


Figure 11.4: Aero-engine: Step responses (2nd input). 


Impulse and step responses from the second input to all the outputs for 
the three reduced systems (with the truncated and optimal Hankel norm 
approximated systems scaled) are shown in Figures 11.3 and 11.4 respectively. 
The responses for the other inputs were found to be similar. The simulations 
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confirm that residualized model’s response is closer to the full order model’s 
response. 


11.6.2. Reduction of an aero-engine controller 


We now consider reduction of a two degrees-of-freedom H,, loop-shaping 
controller. The plant for which the controller is designed is the full order 
gas turbine engine model described in example 11.6.1 above. 

A robust controller was designed using the procedure outlined in subsection 
9.4.3; see Figure 9.21 which describes the design problem. Trer(s) is the desired 
closed-loop transfer function, p is a design parameter, G, = M,1N, is the 
shaped plant and (Ay,, Ay.) are perturbations on the normalized coprime 
factors representing uncertainty. We denote the actual closed-loop transfer 
function (from 3 to y) by Tyg. 

The controller K = [Ky K2], which excludes the loop-shaping weight Wi 
(which includes 3 integral action states), has 6 inputs (because of the two 
degrees-of-freedom structure), 3 outputs, and 24 states. It has not been scaled 
(i.e. the steady-state value of T,g has not been matched to that of Trer by 
scaling the prefilter). It is reduced to 7 states in each of the cases that follow. 

Let us first compare the magnitude of Tyg with that of the specified model 
Trer. By magnitude, we mean singular values. These are shown in Figure 
11.5 (a). The infinity norm of the difference is computed to be 0.974 and 
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(a) Unscaled prefilter [K, Ke]. (b) Scaled prefilter [K,W; Ko]. 


Figure 11.5: Singular values of Trer and Ty. 


occurs at 8.5 rad/sec. Note that we have p = 1 and the ¥ achieved in the 
Hoo. optimization is 2.32, so that ||Tyg — Tret|loo < 7p? as required; see (9.81). 
The prefilter is now scaled so that Tyg matches T,.¢ exactly at steady-state, 
ie. we replace Ky by KiW; where W; = Tyg(0)~'Trer(0). It is argued by 
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Hoyle et al. (1991) that this scaling produces better model matching at all 
frequencies, because the H.. optimization process has already given Ty, the 
same magnitude frequency response shape as the model Tyep. The scaled 
transfer function is shown in Figure 11.5 (b), and the infinity norm of the 
difference (Tyg — Tres) computed to be 1.44 (at 46 rad/sec). It can be seen 
that this scaling has not degraded the infinity norm of the error significantly 
as was claimed by Hoyle et al. (1991). To ensure perfect steady-state tracking 
the controller is always scaled in this way. We are now in a position to discuss 
ways of reducing the controller. We will look at the following two approaches: 


1. The scaled controller [K,W; Ky] is reduced. A balanced residualization 
of this controller preserves the controller’s steady-state gain and would not 
need to be scaled again. Reductions via truncation and optimal Hankel 
norm approximation techniques, however, lose the steady-state gain. The 
prefilters of these reduced controllers would therefore need to be rescaled 
to match Tyer (0). 

2. The full order controller [Ky Ko] is directly reduced without first scaling 
the prefilter. In which case, scaling is done after reduction. 


We now consider the first approach. A balanced residualization of 
[KkiW; Ke] is obtained. The theoretical upper bound on the infinity norm 
of the error (twice the sum of the tail) is 0.698, i.e. 


|kiW:- GaWi)a Ke - Kaa |loo < 0.698 (11.32) 


where the subscript a refers to the low order approximation. The actual error 
norm is computed to be 0.365. Tyg for this residualization is computed and 
its magnitude plotted in Figure 11.6 (a). The infinity norm of the difference 
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Figure 11.6: Singular values of T;e: and Tyg for reduced [KiW; Ka]. 


(Ty — Trep) is computed to be 1.44 (at 43 rad/sec). This value is very close 
to that obtained with the full order controller [K,W; 2], and so the 
closed-loop response of the system with this reduced controller is expected 
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to be very close to that with the full order controller. Next [KiW; Ko] is 
reduced via balanced truncation. The bound given by (11.32) still holds. The 
steady-state gain, however, falls below the adjusted level, and the prefilter 
of the truncated controller is thus scaled. The bound given by (11.32) can 
no longer be guaranteed for the prefilter (it is in fact found to degrade to 
3.66), but it holds for Ky —Ko2q. Singular values of Trer and Tyg for the scaled 
truncated controller are shown in Figure 11.6 (b). The infinity norm of the 
difference is computed to be 1.44 and this maximum occurs at 46 rad/sec. 
Finally [kiW; Ke] is reduced by optimal Hankel norm approximation. The 
following error bound is theoretically guaranteed: 


|| AiW; —(KiWi)a Ke — Koa |loo < 0.189 (11.33) 


Again the reduced prefilter needs to be scaled and the above bound can no 
longer be guaranteed; it actually degrades to 1.87. Magnitude plots of Tyg 
and T,e¢ are shown in Figure 11.6 (c), and the infinity norm of the difference 
is computed to be 1.43 and occurs at 43 rad/sec. 

It has been observed that both balanced truncation and optimal Hankel 
norm approximation cause a lowering of the system steady-state gain. In the 
process of adjustment of these steady-state gains, the infinity norm error 
bounds are destroyed. In the case of our two degrees-of-freedom controller, 
where the prefilter has been optimized to give closed-loop responses within a 
tolerance of a chosen ideal model, large deviations may be incurred. Closed- 
loop responses for the three reduced controllers discussed above are shown in 
Figures 11.7, 11.8 and 11.9. 

It is seen that the residualized controller performs much closer to the full 
order controller and exhibits better performance in terms of interactions and 
overshoots. It may not be possible to use the other two reduced controllers 
if the deviation from the specified model becomes larger than the allowable 
tolerance, in which case the number of states by which the controller is reduced 
would probably have to be reduced. It should also be noted from (11.32) and 
(11.33) that the guaranteed bound for Ky — Ko, is lowest for optimal Hankel 
norm approximation. 

Let us now consider the second approach. The controller [K, Ke] 
obtained from the H,, optimization algorithm is reduced directly. The 
theoretical upper bound on the error for balanced residualization and 
truncation is 

|| ky — Kia Ke — Kaa |loo < 0.165 (11.34) 


The residualized controller retains the steady-state gain of [K, Ko]. It is 
therefore scaled with the same W; as was required for scaling the prefilter 
of the full order controller. Singular values of T,.¢ and Tyg for this reduced 
controller are shown in Figure 11.10 (a), and the infinity norm of the difference 
was computed to be 1.50 at 44 rad/sec. [Ky 2] is next truncated. The 
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Figure 11.8: Closed-loop step responses: [kK,W; K2] balanced truncated. 
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Figure 11.10: Singular values of T;-r and Tyg for reduced [K, Ka]. 


steady-state gain of the truncated controller is lower than that of [K, Ko], 
and it turns out that this has the effect of reducing the steady-state gain of 
Tyg. Note that the steady-state gain of Tyg is already less than that of Ther 
(Figure 11.5). Thus in scaling the prefilter of the truncated controller, the 
steady-state gain has to be pulled up from a lower level as compared with 
the previous (residualized) case. This causes greater degradation at other 
frequencies. The infinity norm of (Tyg — Trer) in this case is computed to 
be 25.3 and occurs at 3.4 rad/sec (see Figure 11.10 (b)). Finally [K, Ko] is 
reduced by optimal Hankel norm approximation. The theoretical bound given 
in (11.30) is computed and found to be 0.037, i.e. we have 


(KiShis Kea Koalhs 0037 (11.35) 


The steady-state gain falls once more in the reduction process, and again a 
larger scaling is required. Singular value plots for Tyg and Tyrer are shown in 
Figure 11.10 (c). |/Zya — Tret|loo is computed to be 4.50 and occurs at 5.1 
rad/sec. 

Some closed-loop step response simulations are shown in Figures 11.11, 
11.12 and 11.13. It can be seen that the truncated and Hankel norm 
approximated systems have deteriorated to an unacceptable level. Only the 
residualized system maintains an acceptable level of performance. 

We have seen that the first approach yields better model matching, though 
at the expense of a larger infinity norm bound on Ky — K2_ (compare (11.32) 
and (11.34), or (11.33) and (11.35)). We have also seen how the scaling of the 
prefilter in the first approach gives poorer performance for the truncated and 
optimal Hankel norm approximated controllers, relative to the residualized 
one. 

In the second case, all the reduced controllers need to be scaled, but 
a “larger” scaling is required for the truncated and optimal Hankel norm 
approximated controllers. There appears to be no formal proof of this 
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observation. It is, however, intuitive in the sense that controllers reduced 
by these two methods yield poorer model matching at steady-state as 
compared with that achieved by the full order controller. A larger scaling 
is therefore required for them than is required by the full order or residualized 
controllers. In any case, this larger scaling gives poorer model matching at 
other frequencies, and only the residualized controller’s performance is deemed 
acceptable. 


11.7 Reduction of unstable models 


Balanced truncation, balanced residualization and optimal Hankel norm 
approximation only apply to stable models. In this section we will briefly 
present two approaches for reducing the order of an unstable model. 


11.7.1 Stable part model reduction 


Enns (1984) and Glover (1984) proposed that the unstable model could first 
be decomposed into its stable and anti-stable parts. Namely 


G(s) = G(s) + G(s) (11.36) 


where G(s) has all its poles in the closed right-half plane and G(s) has all its 
poles in the open left-half plane. Balanced truncation, balanced residualization 
or optimal Hankel norm approximation can then be applied to the stable part 
G,(s) to find a reduced order approximation G,,(s). This is then added to 
the anti-stable part to give 


Ga(s) = Guls) + Gea(s) (11.37) 


as an approximation to the full order model G(s). 


11.7.2 Coprime factor model reduction 


The coprime factors of a transfer function G(s) are stable, and therefore we 
could reduce the order of these factors using balanced truncation, balanced 
residualization or optimal Hankel norm approximation, as proposed in the 
following scheme (McFarlane and Glover, 1990). 


e Let G(s) = M~1(s)N(s), where M(s) and N(s) are stable left-coprime 
factors of G(s). 

e Approximate [N M] of degree n by [Na M,] of degree k < n, using 
balanced truncation, balanced residualization or optimal Hankel norm 
approximation. 
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e Realize the reduced order transfer function G,(s), of degree k, by Ga(s) = 
Mz1Na. 


A dual procedure could be written down based on a right coprime 
factorization of G(s). 

For related work in this area, we refer the reader to (Anderson and 
Liu, 1989; Meyer, 1987). In particular, Meyer (1987) has derived the following 
result: 


Theorem 11.3 Let (N,M) be a normalized left-coprime factorization of G(s) 
of degree n. Let [Na Ma] be a degree k balanced truncation of [N M] which 


has Hankel singular values 01 > 02 >... > On, > Ongi > ++: > On > 0. 
Then (Na,Ma) is a normalized left-coprime factorization of Ga = M;1Na, 
and |N, M,] has Hankel singular values o1,02,...,0.. 


Exercise 11.3 Is Theorem 11.8 true, if we replace balanced truncation by balanced 
residualization? 


11.8 Model reduction using MATLAB 


The commands in Table 11.2 from the MATLAB w-toolbox may be used 
to perform model reduction for stable systems. For an unstable system the 
commands in Table 11.3 may be used. 


Table 11.2: MATLAB commands for model reduction of stable system 


¥, Uses the Mu-toolbox 


sysd=strans (sys); % order states in Jordan form according to speed 
syst=strunc(sys,k); % then: truncate leaving k states in syst. 
sysr=sresid(sys,k); % or: residualize leaving k states in sysr. 

y 

h 

[sysb, hsig]=sysbal (sys) ; ¥, obtain balanced realization. 
sysbt=strunc(sysb,k); % then: balanced truncation leaving k states. 
sysbr=sresid(sysb,k); %, or: balanced residualization. 

sysh=hankmr (sysb,hsig,k,’d’); % or: optimal Hankel norm approximation 


Alternatively, the command [ar,br,cr,dr]=ohklmr(a,b,c,d,1,k) in the 
MATLAB Robust Control Toolbox finds directly the optimal Hankel norm 
approximation of an unstable plant based on first decomposing the system 
into the stable and unstable parts. It avoids the sometimes numerically ill- 
conditioned step of first finding a balanced realization. 
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Table 11.3: MATLAB commands for model reduction of unstable system 
% Uses the Mu-toolbox 


[syss,sysu]=sdecomp (sys) ; % decompose into stable and unstable part. 
sysi=sresid(sysbal(syss) ,ks) ; % balanced residualization of stable part. 
sysi=hankmr (sysbal(syss),ks,’d’); % or: Hankel norm approx. of stable part. 
syssbr=madd(sys1,sysu) ; % vealize reduced-order system. 

y 

h 

[nlcf,hsig,nrcf]=sncfbal (sys); % balanced realization of coprime factors. 
nrcfr=sresid(nrcf,k); % vesidualization of coprime factors. 
syscbr=cf2sys(nrcfr); % vealize reduced-order system. 


11.9 Conclusions 


We have presented and compared three main methods for model reduction 
based on balanced realizations: balanced truncation, balanced residualization 
and optimal Hankel norm approximation. 

Residualization, unlike truncation and optimal Hankel norm approximation, 
preserves the steady-state gain of the system, and, like truncation, it is simple 
and computationally inexpensive. It is observed that truncation and optimal 
Hankel norm approximation perform better at high frequencies, whereas 
residualization performs better at low and medium frequencies, i.e., up to 
the critical frequencies. Thus for plant model reduction, where models are not 
accurate at high frequencies to start with, residualization would seem to be 
a better option. Further, if the steady-state gains are to be kept unchanged, 
truncated and optimal Hankel norm approximated systems require scaling, 
which may result in large errors. In such a case, too, residualization would be 
a preferred choice. 

Frequency weighted model reduction has been the subject of numerous 
papers over the past few years. The idea is to emphasize frequency ranges 
where better matching is required. This, however, has been observed to have 
the effect of producing larger errors (greater mismatching) at other frequencies 
(Anderson, 1986; Enns, 1984). In order to get good steady-state matching, a 
relatively large weight would have to be used at steady-state, which would 
cause poorer matching elsewhere. The choice of weights is not straightforward, 
and an error bound is available only for weighted Hankel norm approximation. 
The computation of the bound is also not as easy as in the unweighted case 
(Anderson and Liu, 1989). Balanced residualization can in this context, be 
seen as a reduction scheme with implicit low and medium frequency weighting. 

For controller reduction, we have shown in a two degrees-of-freedom 
example, the importance of scaling and steady-state gain matching. Two 
approaches were considered. In the first approach, the prefilter of the full order 
controller was scaled beforehand. In this case, the prefilter of the residualized 
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system does not need to be scaled, and enjoys the guaranteed infinity norm 
error bounds. The prefilters of the other reduced systems had to be scaled, and 
this introduced large errors (with respect to the scaled full order prefilter). 
They were thus seen to give poorer performance. In the second approach, the 
unscaled full order controller [Ky K2] was first reduced, and then scaled. 
The residualized system needed the same scaling matrix as the full order 
controller. With the other reduced controllers, the model matching at steady- 
state deteriorated compared to the full order controller, and hence a larger 
scaling was needed. This caused very poor matching at other frequencies. The 
residualized controller thus performed better than the other two. As for the 
feedback part of the controller (2), the error bound given by optimal Hankel 
norm approximation was the best. 

It is remarked that, in general, steady-state gain matching may not be 
crucial. The matching should, however be good near the desired closed- 
loop bandwidth and for feedback controller design we need a good match 
around the gain crossover frequency. Balanced residualization has been seen 
to perform very close to the full order system in this frequency range. 
Good approximation at high frequencies may also sometimes be desired. In 
such a case, using truncation or optimal Hankel norm approximation with 
appropriate frequency weightings may yield better results. 


500 


MULTIVARIABLE FEEDBACK CONTROL 


12 


CASE STUDIES 


In this chapter, we present three case studies which illustrate a number of 
important practical issues, namely: weights selection in Ho. mixed-sensitivity design, 
disturbance rejection, output selection, two degrees-of-freedom Hoo loop-shaping 
design, ill-conditioned plants, 1 analysis and yp synthesis. 


12.1 Introduction 


The complete design process for an industrial control system will normally 
include the following steps: 


1. Plant modelling: to determine a mathematical model of the plant either 
from experimental data using identification techniques, or from physical 
equations describing the plant dynamics, or a combination of these. 

2. Plant input-output controllability analysis: to discover what closed-loop 
performance can be expected and what inherent limitations there are to 
‘good’ control, and to assist in deciding upon an initial control structure 
and may be an initial selection of performance weights. 

3. Control structure design: to decide on which variables to be manipulated 
and measured and which links should be mode between them. 

4. Controller design: to formulate a mathematical design problem which 
captures the engineering design problem and to synthesize a corresponding 
controller. 

5. Control system analysis: to assess the control system by analysis 
and simulation against the performance specifications or the designer’s 
expectations. 

6. Controller implementation: to implement the controller, almost certainly 
in software for computer control, taking care to address important issues 
such as anti-windup and bumpless transfer. 

7. Control system commissioning: to bring the controller on-line, to carry 
out on-site testing and to implement any required modifications before 
certifying that the controlled plant is fully operational. 
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In this book we have focused on steps 2, 3, 4 and 5, and in this chapter we will 
present three case studies which demonstrate many of the ideas and practical 
techniques which can be used in these steps. The case studies are not meant 
to produce the ‘best’ controller for the application considered but rather are 
used here to illustrate a particular technique from the book. 

In case study 1, a helicopter control law is designed for the rejection of 
atmospheric turbulence. The gust disturbance is modelled as an extra input 
to an S/KS Hoo mixed-sensitivity design problem. Results from nonlinear 
simulations indicate significant improvement over a standard $/K'S design. 

Case study 2 illustrates the application and usefulness of the two degrees- 
of-degree H.. loop-shaping approach by applying it to the design of a robust 
controller for a high performance areo-engine. Nonlinear simulation results are 
shown. Efficient and effective tools for control structure design (input-output 
selection) are also described and applied to this problem. 

The final case study is concerned with the control of an idealized distillation 
column. A very simple plant model is used, but it is sufficient to illustrate the 
difficulties of controlling ill-conditioned plants and the adverse effects of model 
uncertainty. The structured singular value pz is seen to be a powerful tool for 
robustness analysis. 

Case studies 1, 2 and 3 are based on papers by Postlethwaite, Foster and 
Walker (1994), Samar and Postlethwaite (1994), and Skogestad et al. (1988), 
respectively. 


12.2 Helicopter control 


This case study is used to illustrate how weights can be selected in H.. mixed- 
sensitivity design, and how this design problem can be modified to improve 
disturbance rejection properties. 


12.2.1 Problem description 


In this case study, we consider the design of a controller to reduce the 
effects of atmospheric turbulence on helicopters. The reduction of the effects 
of gusts is very important in reducing a pilot’s workload, and enables 
aggressive manoeuvers to be carried out in poor weather conditions. Also, 
as a consequence of decreased buffeting, the airframe and component lives are 
lengthened and passenger comfort is increased. 

The design of rotorcraft flight control systems, for robust stability and 
performance, has been studied over a number of years using a variety 
of methods including: H. optimization (Yue and Postlethwaite, 1990; 
Postlethwaite and Walker, 1992); eigenstructure assignment (Manness and 
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Murray-Smith, 1992; Samblancatt, Apkarian and Patton, 1990); sliding mode 
control (Foster, Spurgeon and Postlethwaite, 1993); and H» design (Takahashi, 
1993). The H.,. controller designs have been particularly successful (Walker, 
Postlethwaite, Howitt and Foster, 1993), and have proved themselves in 
piloted simulations. These designs have used frequency information about the 
disturbances to limit the system sensitivity but in general there has been no 
explicit consideration of the effects of atmospheric turbulence. Therefore by 
incorporating practical knowledge about the disturbance characteristics, and 
how they affect the real helicopter, improvements to the overall performance 
should be possible. We will demonstrate this below. 

The nonlinear helicopter model we will use for simulation purposes was 
developed at the Defence Research Agency (DRA), Bedford (Padfield, 1981) 
and is known as the Rationalized Helicopter Model (RHM). A turbulence 
generator module has recently been included in the RHM and this enables 
controller designs to be tested on-line for their disturbance rejection 
properties. It should be noted that the model of the gusts affects the helicopter 
equations in a complicated fashion and is self contained in the code of the 
RHM. For design purposes we will imagine that the gusts affect the model in 
a much simpler manner. 

We will begin by repeating the design of Yue and Postlethwaite (1990) which 
used an S/K.S H.. mixed sensitivity problem formulation without explicitly 
considering atmospheric turbulence. We will then, for the purposes of design, 
represent gusts as a perturbation in the velocity states of the helicopter 
model and include this disturbance as an extra input to the S/KS design 
problem. The resulting controller is seen to be substantially better at rejecting 
atmospheric turbulence than the earlier standard S/KS design. 


12.2.2 The helicopter model 


The aircraft model used in our work is representative of the Westland Lynx, a 
twin-engined multi-purpose military helicopter, approximately 9000 lbs gross 
weight, with a four-blade semi-rigid main rotor. The unaugmented aircraft is 
unstable, and exhibits many of the cross-couplings characteristic of a single 
main-rotor helicopter. In addition to the basic rigid body, engine and actuator 
components, the model also includes second order rotor flapping and coning 
modes for off-line use. The model has the advantage that essentially the same 
code can be used for a real-time piloted simulation as for a workstation-based 
off-line handling qualities assessment. 

The equations governing the motion of the helicopter are complex and 
difficult to formulate with high levels of precision. For example, the rotor 
dynamics are particularly difficult to model. A robust design methodology is 
therefore essential for high performance helicopter control. The starting point 
for this study was to obtain an eighth-order differential equation modelling the 
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State Description 
(2) Pitch Attitude 
® Roll Attitude 
p Roll Rate 
qd Pitch Rate 
r Yaw Rate 
Ux Forward Velocity 
Vy Lateral Velocity 
Vz Vertical Velocity 


Table 12.1: Helicopter state vector 


small-perturbation rigid motion of the aircraft about hover. The corresponding 
state-space model is 


& = Ax+Bu (12.1) 
y = Cx (12.2) 


where the matrices A, B and C for the appropriately scaled system are avail- 
able over the Internet as described in the preface. The eight state rigid body 
vector x is given in the Table 12.1. The outputs consist of four controlled 
outputs 


Heave velocity H 
Pitch attitude 0 
Roll attitude ¢ 
Heading rate a 


together with two additional measurements 


e Body axis pitch p 
e Roll reate q 


One degree-of-freedom controllers as shown in Figure 12.1 are to be 
designed. Notice that in the standard one degree-of-freedom configuration 
the pilot reference commands r; are augmented by a zero vector because of 
the rate feedback signals . These zeros indicate that there are no a priori 
performance specifications on yo =[p_ q]. 

We are interested in the design of full-authority controllers, which means 
that the controller has total control over the blade angles of the main and 
tail rotors, and is interposed between the pilot and the actuation system. It 
is normal in conventional helicopters for the controller to have only limited 
authority leaving the pilot to close the loop for much of the time. 
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Figure 12.1: Helicopter control structure (a) as implemented, (b) in the standard 
one degree-of-freedom configuration 


The controller generates four blade angle demands which are effectively the 
helicopter inputs, since the actuators (which are typically modelled as first 
order lags) are modelled as unity gains in this study. The blade angles are 


e main rotor collective 
e longitudinal cyclic 

e lateral cyclic 

e tail rotor collective 


The action of each of these blade angles can be briefly described as follows. 
The main rotor collective changes all the blades of the main rotor by an equal 
amount and so roughly speaking controls lift. The longitudinal and lateral 
cyclic inputs change the main rotor blade angles differently thereby tilting 
the lift vector to give longitudinal and lateral motion, respectively. The tail 
rotor is used to balance the torque generated by the main rotor, and so stops 
the helicopter spinning around; it is also used to give lateral motion. This 
description which assumes the helicopter inputs and outputs are decoupled 
is useful to get a feeling of how a helicopter works but the dynamics are 
actually highly coupled. They are also unstable, and about some operating 
points exhibit non-minimum phase characteristics. 
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12.2.3. H,. mixed-sensitivity design 


We will consider the H.. mixed-sensitivity design problem illustrated in 
Figure 12.2. It can be viewed as a tracking problem as previously discussed in 


Figure 12.2: S/KS mixed-sensitivity minimization 


Chapter 9 (see Figure 9.11), but with an additional weight W3. W; and W2 are 
selected as loop-shaping weights whereas W3 is signal-based. The optimization 
problem is to find a stabilizing controller K to minimize the cost function 


W,SWa 
[| wisi. I ns) 


This cost was also considered by Yue and Postlethwaite (1990) in the context 
of helicopter control. Their controller was successfully tested on a piloted flight 
simulator at DRA Bedford and so we propose to use the same weights here. 
The design weights W1, W2 and W3 were selected as 


s+12 s+2.81 s$+2.81 
= di 3 : . 
wi as {0 95400127 995+ 0.0087 98°95 40.008" 
s+10 2s 2s 
3) 12.4 
s+0.01’ (s+ 4)(s + 4.5)’ aaeris} ee 
s + 0.0001 
= .j—@___ I 12. 
We. 0.5 iG 4 (12.5) 
W3 = diag{1, 1, 1, 1, 0.1, 0.1} (12.6) 


The reasoning behind these selections of Yue and Postlethwaite (1990) is 
summarized below. 

Selection of Wi(s): For good tracking accuracy in each of the controlled 
outputs the sensitivity function is required to be small. This suggests forcing 
integral action into the controller by selecting an s~! shape in the weights 
associated with the controlled outputs. It was not thought necessary to have 
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exactly zero steady-state errors and therefore these weights were given a finite 
gain of 500 at low frequencies. (Notice that a pure integrator cannot be 
included in W, anyway, since the standard H.,. optimal control problem would 
not then be well posed in the sense that the corresponding generalized plant 
P could not then be stabilized by the feedback controller kK.) In tuning W, it 
was found that a finite attenuation at high frequencies was useful in reducing 
overshoot. Therefore, high-gain low-pass filters were used in the primary 
channels to give accurate tracking up to about 6 rad/s. The presence of 
unmodelled rotor dynamics around 10 rad/s limits the bandwidth of W. With 
four inputs to the helicopter, we can only expect to independently control four 
outputs. Because of the rate feedback measurements the sensitivity function 
S is a six by six matrix and therefore two of its singular values (corresponding 
to p and q) are always close to one across all frequencies. All that can be done 
in these channels is to improve the disturbance rejection properties around 
crossover, 4 to 7 rad/s, and this was achieved using second-order band-pass 
filters in the rate channels of W,. 

Selection of W2(s): The same first-order high-pass filter is used in each 
channel with a corner frequency of 10 rad/s to limit input magnitudes at high 
frequencies and thereby limit the closed-loop bandwidth. The high frequency 
gain of W. can be increased to limit fast actuator movement. The low 
frequency gain of W 2 was set to approximately —100 dB to ensure that the 
cost function is dominated by W, at low frequencies. 

Selection of W3(s): W3 is a weighting on the reference input r. It is chosen 
to be aconstant matrix with unity weighting on each of the output commands 
and a weighting of 0.1 on the fictitious rate demands. The reduced weighting 
on the rates (which are not directly controlled) enables some disturbance 
rejection on these outputs, without them significantly affecting the cost 
function. The main aim of W3 is to force equally good tracking of each of 
the primary signals. 

For the controller designed using the above weights, the singular value 
plots of S and KS are shown in Figures 12.3 (a) and 12.3 (b). These have 
the general shapes and bandwidths designed for and, as already mentioned, 
the controlled system performed well in piloted simulation. The effects of 
atmospheric turbulence will be illustrated later after designing a second 
controller in which disturbance rejection is explicitly included in the design 
problem. 


12.2.4 Disturbance rejection design 


In the design below we will assume that the atmospheric turbulence can be 
modelled as gust velocity components that perturb the helicopter’s velocity 
states vz, v, andv, byd=[d, dy ds ie as in the following equations. The 
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Figure 12.3: Singular values of S and KS (S/KS design) 


disturbed system is therefore expressed as 


£ 


Ar +A ; +Bu (12.7) 
y = Cz. (12.8) 


Define Ba S columns 6,7 and 8 of A. Then we have 


Az + Bu+ Bad (12.9) 
Cx (12.10) 


which in transfer function terms can be expressed as 
y = G(s)u+ Ga(s)d (12.11) 


where G(s) = C(sI — A)~!B, and Ga(s) = C(sI — A)~'Ba. The design 
problem we will solve is illustrated in Figure 12.4. The optimization problem 
is to find a stabilizing controller K that minimizes the cost function 


| WiSW; —WiSGaW, | (12.12) 


W2KSW3 —W2KSGaW,4 


‘ 
i 
cast into the general control configuration and solved using standard software. 
Notice that if we set W4 to zero the problem reverts to the S/KS mixed- 
sensitivity design of the previous subsection. To synthesize the controller we 
used the same weights W,, W2 and W3 as in the S/KS design, and selected 
W, = al, with a a scalar parameter used to emphasize disturbance rejection. 


which is the H.. norm of the transfer function from to z. This is easily 
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Figure 12.4: Disturbance rejection design 


After a few iterations we finalized on a = 30. For this value of a, the singular 
value plots of S and KS, see Figures 12.5 (a) and 12.5 (b), are quite similar 
to those of the S/S design, but as we will see in the next subsection there 
is a significant improvement in the rejection of gusts. Also, since Gq shares 
the same dynamics as G, and W, is a constant matrix, the degree of the 
disturbance rejection controller is the same as that for the S/K'S design. 


10-3 10-2 10-1 109 10! 102 108 1076-3 qo-2 10-1 109 10! 102 108 
Frequency [rad/s] Frequency [rad/s] 
(a) S$ (b) KS 


Figure 12.5: Singular values of S and KS (disturbance rejection design) 
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12.2.5 Comparison of disturbance rejection properties 
of the two designs 


To compare the disturbance rejection properties of the two designs we 
simulated both controllers on the RHM nonlinear helicopter model equipped 
with a statistical discrete gust model for atmospheric turbulence, (Dahl and 
Faulkner, 1979). With this simulation facility, gusts cannot be generated at 
hover and so the nonlinear model was trimmed at a forward flight speed 
of 20 knots (at an altitude of 100 ft), and the effect of turbulence on the 
four controlled outputs observed. Recall that both designs were based on 
a linearized model about hover and therefore these tests at 20 knots also 
demonstrate the robustness of the controllers. Tests were carried out for 
a variety of gusts, and in all cases the disturbance rejection design was 
significantly better than the S/K'S design. 

In Figure 12.6, we show a typical gust generated by the RHM. The effects 
of this on the controlled outputs are shown in Figures 12.7 and 12.8 for the 
S/KS design and the disturbance rejection design, respectively. Compared 
with the S/KS design, the disturbance rejection controller practically halves 
the turbulence effect on heavy velocity, pitch attitude and roll attitude. The 
change in the effect on heading rate is small. 
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Figure 12.6: Velocity components of turbulence (time in seconds) 


CASE STUDIES 511 
Heave Velocity Pitch Attitude 
2 
0.1 
1 ee [ i 
eh i | | | 3 
| 
BO Po lea I & 0 \ : 
| | \al | / | \ 
1 \ : \/ ) \ 
0.4 
% 5 10 15 0 5 10 15 
Roll Attitude Heading Rate 
0.2 
0.2 : 
0 
n | kg 
Z | oe ’ —J 
Zo 3 vA ce 
s 
| 0.4 
-0.2 
0 5 10 15 0.25 5 10 15 


Figure 12.7: Response to turbulence of the S/KS design (time in seconds) 
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Figure 12.8: Response to turbulence of the disturbance rejection design (time in 
seconds) 
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12.2.6 Conclusions 


The two controllers designed were of the same degree and had similar 
frequency domain properties. But by incorporating knowledge about 
turbulence activity into the second design, substantial improvements in 
disturbance rejection were achieved. The reduction of the turbulence effects 
by a half in heave velocity, pitch attitude and roll attitude indicates the 
possibility of a significant reduction in a pilot’s workload, allowing more 
aggressive manoeuvers to be carried out with greater precision. Passenger 
comfort and safety would also be increased. 

The study was primarily meant to illustrate the ease with which information 
about disturbances can be beneficially included in controller design. The case 
study also demonstrated the selection of weights in H.. mixed-sensitivity 
design. 


12.3 Aero-engine control 


In this case study, we apply a variety of tools to the problem of output 
selection, and illustrate the application of the two degrees-of-freedom Glover- 
McFarlane H,,. loop-shaping design procedure. 


12.3.1 Problem description 


This case study explores the application of advanced control techniques to the 
problem of control structure design and robust multivariable controller design 
for a high performance gas turbine engine. The engine under consideration 
is the Spey engine which is a Rolls Royce 2-spool reheated turbofan, used 
to power modern military aircraft. The engine has two compressors: a low 
pressure (LP) compressor or fan, and a high pressure (HP) or core compressor 
as shown in Figure 12.9. The high pressure flow at the exit of the core 
compressor is combusted and allowed to partially expand through the HP 
and LP turbines which drive the two compressors. The flow finally expands 
to atmospheric pressure at the nozzle exit, thus producing thrust for aircraft 
propulsion. The efficiency of the engine and the thrust produced depends on 
the pressure ratios generated by the two compressors. If the pressure ratio 
across a compressor exceeds a certain maximum, it may no longer be able to 
hold the pressure head generated and the flow will tend to reverse its direction. 
This happens in practice, with the flow actually going negative, but it is only 
a momentary effect. When the back pressure has cleared itself, positive flow 
is re-established but, if flow conditions do not change, the pressure builds 
up causing flow reversal again. Thus the flow surges back and forth at high 
frequency, the phenomenon being referred to as surge. Surging causes excessive 
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Figure 12.9: Schematic of the aero-engine 


aerodynamic pulsations which are transmitted through the whole machine and 
must be avoided at all costs. However, for higher performance and greater 
efficiency the compressors must also be operated close to their surge lines. 
The primary aim of the control system is thus to control engine thrust whilst 
regulating compressor surge margins. But these engine parameters, namely 
thrust and the two compressor surge margins, are not directly measurable. 
There are, however, a number of measurements available which represent these 
quantities, and our first task is to choose from the available measurements, the 
ones that are in some sense better for control purposes. This is the problem 
of output selection as discussed in Chapter 10. 

The next step is the design of a robust multivariable controller which 
provides satisfactory performance over the entire operating range of the 
engine. Since the aero-engine is a highly nonlinear system, it is normal for 
several controllers to be designed at different operating points and then to 
be scheduled across the flight envelope. Also in an aero-engine there are 
a number of parameters, apart from the ones being primarily controlled, 
that are to be kept within specified safety limits, e.g. the turbine blade 
temperature. The number of parameters to be controlled and/or limited 
exceeds the number of available inputs, and hence all these parameters cannot 
be controlled independently at the same time. The problem can be tackled 
by designing a number of scheduled controllers, each for a different set of 
output variables, which are then switched between, depending on the most 
significant limit at any given time. The switching is usually done by means of 
lowest-wins or highest-wins gates, which serve to propagate the output of the 
most suitable controller to the plant input. Thus, a switched gain-scheduled 
controller can be designed to cover the full operating range and all possible 
configurations. In Postlethwaite et al. (1995) a digital multi-mode scheduled 
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controller is designed for the Spey engine under consideration here. In their 
study gain-scheduling was not required to meet the design specifications. 
Below we will describe the design of a robust controller for the primary engine 
outputs using the two degrees-of-freedom H. loop-shaping approach. The 
same methodology was used in the design of Postlethwaite et al. (1995) which 
was successfully implemented and tested on the Spey engine. 


12.3.2 Control structure design: output selection 


The Spey engine has three inputs, namely fuel flow (WFE), a nozzle with a 
variable area (AJ), and inlet guide vanes with a variable angle setting (IGV): 


u=[WFE AJ IGv]" 
In this study, there are six output measurements available, 
yar =[NL OPR1 OPR2 LPPR LPEMN NH]" 


as described below. with three inputs we can independently control only three 
For each one of the six output measurements, a look-up table provides its 
desired optimal value (set point) as a function of the operating point. However, 
with three inputs we can only control three outputs independently so the first 
question we face is, which three? 

Engine thrust (one of the parameters to be controlled) can be defined in 
terms of the LP compressor’s spool speed (NL), the ratio of HP compressor’s 
outlet pressure to engine inlet pressure (OPR1), or the engine overall pressure 
ratio (OPR2). We will choose from these three measurements the one that is 
best for control, 


e Engine thrust: Select one of NL, QPR1 and QPR2 (outputs 1, 2 and 3). 


Similarly surge margin of the LP compressor can be represented by the LP 
compressor’s pressure ratio (LPPR), or the LP compressor’s exit much number 
measurement (LPEMN), and a selection between the two has to be made. 


e Surge margin: Select one of LPPR and LPEMN (outputs 4 and 5). 


In this study we will not consider control of the HP compressor’s surge margin, 
or other configurations concerned with the limiting of engine temperatures. 
Our third output will be the HP compressor’s spool speed (NH), which it is 
also important to maintain within safe limits. (NH is actually the HP spool 
speed made dimensionless by dividing by the square root of the total inlet 
temperature and scaled so that it is a percentage of the maximum spool 
speed at a standard temperature of 288.15°K). 


e Spool speed: Select NH (output 6). 
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We have now subdivided the available outputs into three subsets, and decided 
to select one output from each subset. This gives rise to the six candidate 
output sets as listed in Table 12.2. 

We now apply some of the tools given in Chapter 10 for tackling the 
output selection problem. It is emphasized at this point that a good physical 
understanding of the plant is very important in the context of this problem, 
and some measurements may have to be screened beforehand on practical 
grounds. A 15-state linear model of the engine (derived from a non-linear 
simulation at 87% of maximum thrust) will be used in the analysis that 
follows. The model is available over the Internet (as described in the preface), 
along with actuator dynamics which result in a plant model of 18 states for 
controller design. The nonlinear model used in this case study was provided 
by the UK Defence Research Agency at Pyestock. 

Scaling. Some of the tools we will use for control structure selection are 
dependent on the scalings employed. Scaling the inputs and the candidate 
measurements therefore, is vital before comparisons are made and can also 
improve the conditioning of the problem for design purposes. We use the 
method of scaling described in Section 9.4.2. The outputs are scaled such 
that equal magnitudes of cross-coupling into each of the outputs are equally 
undesirable. We have chosen to scale the thrust-related outputs such that one 
unit of each scaled measurement represents 7.5% of maximum thrust. A step 
demand on each of these scaled outputs would thus correspond to a demand 
of 7.5% (of maximum) in thrust. The surge margin-related outputs are scaled 
so that one unit corresponds to 5% surge margin. If the controller designed 
provides an interaction of less than 10% between the scaled outputs (for unit 
reference steps), then we would have 0.75% or less change in thrust for a step 
demand of 5% in surge margin, and a 0.5% or less change in surge margin 
for a 7.5% step demand in thrust. The final output NH (which is already a 
scaled variable) was further scaled (divided by 2.2) so that a unit change in 
NH corresponds to a 2.2% change in NH. The inputs are scaled by 10% of 
their expected ranges of operation. 

Steady-state model. With these scalings the steady-state model yan = 
Gauu (with all the candidate outputs included) and the corresponding RGA- 
matrix is given by 


0.696 —0.046 —0.001 0.009 0.016 0.000 
1.076 —0.027 0.004 0.016 0.008 —0.000 
1.385 0.087 —0.002 0.006 0.028 —0.000 
Gan(0) = 11.036 0.238 —0.017 A(Gan(0)) = 0.971 —0.001 0.002 
0.064 —0.412 0.000 —0.003 0.950 0.000 
1.474 —0.093 0.983 0.002 —0.000 0.998 
(12.13) 
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Candidate RHP zeros 
Set No. controlled < 100 rad/sec | o (G(0)) 
outputs 
1 NL, LPPR, NH (1, 4, 6) none 0.060 
2 OPR1, LPPR, NH (2, 4, 6) none 0.049 
3 OPR2, LPPR, NH (3, 4, 6) 30.9 0.056 
4 NL, LPEMN, NH (1, 5, 6) none 0.366 
5 OPR1, LPEMN, NH (2, 5, 6) none 0.409 
6 OPR2, LPEMN, NH (3, 5, 6) 27.7 0.392 


Table 12.2: RHP zeros and minimum singular value for the six candidate output 
sets 


and the singular value decomposition of Gan(0) = UpXoV¢" is 


0.062 0.001 0.144 0.944 0.117 0.266 
0.095 0.001 —0.118 —0.070 —0.734 0.659 
Uo - 0.123 —0.025 0.133 —0.286 0.640 0.689 
0.977 —0.129 —0.011 0.103 —0.001 —0.133 
—0.006 0.065  —0.971 0.108 0.195 0.055 
0.131 0.989 0.066  —0.000 0.004 —0.004 


11.296 0 0 
4 te eae 1.000 —0.007 —0.021 

=|] ° 2 Vo = |0.020 —0.154 0.988 
4 4 A 0.010 0.988 0.154 
0 0 0 


ya 
The RGA-matrix of the overall non-square gain matrix, A = Gan x eu is 
sometimes a useful screening when there are many alternatives. The six row- 
sums of the RGA-matrix are 


Ay> = [0.025 0.023 0.034 0.972 0.947 1.000)" 


and from (10.16) this indicates that we should select outputs 4, 5 and 
6 (corresponding to the three largest elements) in order to maximize the 
projection of the selected outputs onto the space corresponding to the three 
nonzero singular values. However, this is not one of our six candidate output 
sets because there is no output directly related to engine thrust (outputs 1, 2 
and 3). 

We now proceed with a more detailed input-output controllability analysis 
of the six candidate output sets. In the following G(s) refers to the transfer 
function matrix for the effect of the three inputs on the selected three outputs. 

Minimum singular value. In Chapter 10, we showed that a reasonable 
criterion for selecting controlled outputs y is to make ||G~!(y — yopt) || small, 
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in particular at steady-state. Here y — Yopt is the deviation in y from its 
optimal value. At steady-state this deviation arises mainly from errors in 
the (look-up table) set point due to disturbances and unknown variations in 
the operating point. If we assume that, with the scalings given above, the 
magnitude |(y — Yopt)a| is similar (close to 1) for each of the six outputs, then 
we should select a set of outputs such that the elements in G~1+(0) are small, 
or alternatively, such that a (G(0)) is as large as possible. In Table 12.2 we 
have listed a (G(0)) for the six candidate output sets. We conclude that we 
can eliminate sets 1,2 and 3 consider only sets 4,5 and 6. For these three sets 
we find that the value of a(G(0)) is between 0.366 and 0.409 which is only 
slightly smaller than (Gy (0)) = 0.417. 


Remark. The three eliminated sets all include output 4, LPPR. Interestingly, this 
output is associated with the largest element in the gain matrix Gau(0) of 11.0, and 
is thus also associated with the largest singular value (as seen from the first column 


of U). 


Right-half plane zeros. Right-half plane (RHP) zeros limit the achievable 
performance of a feedback loop, by limiting the open-loop gain-bandwidth 
product. They can be a cause of concern, particularly, if they lie within 
the closed-loop bandwidth one is aiming for. Choosing different outputs for 
feedback control can give rise to different numbers of RHP zeros at differing 
locations. The choice of outputs should be such that a minimum number of 
RHP zeros are encountered, and should be as far removed from the imaginary 
axis as possible. 

Table 12.2 shows the RHP zeros slower than 100 rad/sec for all combinations 
of prospective output variables. The closed-loop bandwidth requirement for 
the aero-engine is approximately 10 rad/sec. RHP zeros close to this value 
or smaller (closer to the origin) will therefore, cause problems and should be 
avoided. It can be seen that the variable OPR2 introduces (relatively) slow 
RHP zeros. It was observed that these zeros move closer to the origin at higher 
thrust levels. Thus Sets 3 and 6 are unfavourable for closed-loop control. This 
along with the minimum singular value analysis leaves us with sets 4 and 5 
for further consideration 

Relative gain array (RGA). We here consider the RGA of the square 
transfer matrices G(s) with three outputs. 


A(G(s)) = G(s) x G-7T(s) (12.14) 


In Section 3.6.2, it is argued that the RGA provides useful information for 
the analysis of input-output controllability and for the pairing of inputs and 
outputs. Specifically input and output variables should be paired so that the 
diagonal elements of the RGA are as close as possible to unity. Furthermore, 
if the plant has large RGA elements and an inverting controller is used, the 
closed-loop system will have little robustness in the face of diagonal input 
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uncertainty. Such a perturbation is quite common due to uncertainty in the 
actuators. Thus we want A to have small elements and for diagonal dominance 
we want A—TI to be small. These two objectives can be combined in the single 
objective of a small RGA-number, defined as 


RGA-number = [JA — I|sum = >| 1— Aig | + 2 | Aas | (12.15) 


i=j iAj 


The lower the RGA number, the more preferred is the control structure. 
Before calculating the RGA number over frequency we rearranged the output 
variables so that the steady-state RGA matrix was as close as possible to the 
identity matrix. 


Set 3 Set 6 
6b 
Set 2 —| 
i a 
4b Set 1 | 
Set 5 
2 Sp ee eee Z Pia Set 4 
0 
10° 10° a 10! 10° 


10 10° 
Frequenc [rad/s] 
Figure 12.10: RGA numbers. 


The RGA numbers for the six candidate output sets are shown in Figure 
12.10. As in the minimum singular value analysis above, we again see that 
Sets 1,2 and 3 are less favourable. Again sets 4 and 5 are the best but too 
similar to allow a decisive selection. 

Hankel singular values. Notice that Sets 4 and 5 differ only in one output 
variable, NL in Set 4 and OPR1 in Set 5. Therefore, to select between them 
we next consider the Hankel singular values of the two transfer functions 
for sets 4 and 5. Hankel singular values reflect the joint controllability and 
observability of the states of a balanced realization (as described in Section 
11.3). The Hankel singular values are invariant under state transformations 
but they do depend on scaling. 

Figure 12.11 shows the Hankel singular values of the two transfer functions 
for Sets 4 and 5 at two operating points: 87% maximum thrust and 80% 
maximum thrust. A second operating point model is used because again the 
difference is not decisive. However, both cases do indicate that Set 5 has 
associated with it better state controllability and observability properties than 
Set 4. In other words, output OPR1 (Set 5) contains more information about 
the system internal states than output NL (Set 4). It therefore seems to be 
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Figure 12.11: Hankel singular values. 


marginally preferable to use OPR1 for control purposes rather than NL in the 
absence of no other information, and hence Set 5 is our final choice. 


12.3.3. A two degrees-of-freedom 1, loop-shaping 
design 


The design procedure given in Section 9.4.3 will be used to design a two 
degrees-of-freedom H.. loop-shaping controller for the 3-input 3-output plant 
G. An 18 state linear plant model G (including actuator dynamics), is available 
over the Internet. It is based on scaling, output selection (Set 5), and input- 
output pairing as described above. To summarize, the selected outputs (Set 
5) are 


e engine inlet pressure, OPR1 
e LP compressor’s exit mach number measurement, LPEMN 
e HP compressor’s spool speed, NH 


and the corresponding inputs are 


e fuel flow, WFE 
e nozzle area, AJ 
e inlet guide vane angle, IGV 


The corresponding steady-state (s = 0) model and RGA-matrix is 


1.076 —0.027 0.004 
G= |-0.064 —0.412 oe | A(G) = 
1.474 —0.093 0.983 


0.004 0.996 —0.000 (12.16) 


1.002 0.004 —0.006 
—0.006 —0.000 1.006 
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Pairing of inputs and outputs. The pairing of inputs and outputs 
is important in that it makes the design of the prefilter simpler in a two 
degree of freedom control configuration. It is of even greater importance if a 
decentralized control scheme is to be used, and gives insight into the working 
of the plant. In Chapter 10 it is argued that negative entries on the principal 
diagonal of the steady-state RGA should be avoided and that the outputs in 
G should be (re)arranged such that the RGA is close to the identity matrix. 
For the selected output set 5, we see from (12.16) that no rearranging of the 
outputs is needed, that is, we should pair OPR1, LPEMN and NH with WFE, 
AJ and IGV, respectively. 

H. loop shaping design. We follow the design procedure given in 
Section 9.4.3. In steps 1 to 3 we discuss how pre- and postcompensators are 
selected to obtain the desired shaped plant (loop shape) Gs; = W2G'W where 
W, = W,W.W%. In steps 4 to 6 we present the subsequent H.. design. 


1. The singular values of the plant are shown in Figure 12.12 (a) and indicate 
a need for extra low frequency gain to give good steady-state tracking 
and disturbance rejection. The precompensator weight is chosen as simple 
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10° 107 
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(a) Plant. (b) Shaped plant. 


Figure 12.12: Singular values for plant and shaped plant. 


integrators, i.e., W, = ts, and the postcompensator weight is selected as 
Wo = Tz. 

2. WoGW, is next aligned at 7 rad/sec. The align gain W, (used in front of 
W,) is the approximate real inverse of the shaped system at the specified 
frequency. The crossover is thus adjusted to 7 rad/sec in order to give 
a closed-loop bandwidth of approximately 10 rad/sec. Alignment should 
not be used if the plant is ill-conditioned with large RGA elements at the 
selected alignment frequency. In our case the RGA elements are small (see 
Figure 12.10) and hence alignment is not expected to cause problems. 
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3. 


An additional gain W, is used in front of the align gain to give some control 
over actuator usage. W, is adjusted so that the actuator rate limits are not 
exceeded for reference and disturbance steps on the scaled outputs. By some 
trial and error, W, is chosen to be diag(1, 2.5, 0.3). This indicates that the 
second actuator (AJ) is made to respond at higher rates whereas the third 
actuator (IGV) is made slower. The shaped plant now becomes G = GW, 
where W, = W,W,W. Its singular values are shown in Figure 12.12 (b). 


- Ymin in (9.66) for this shaped plant is found to be 2.3 which indicates that 


the shaped plant is compatible with robust stability. 


. p is set to 1 and the reference model Treg is chosen as Tyrep = 


diag(sa =e woe a or a ). The third output NH is thus made slower 


than the other two in following reference inputs. 


. The standard H.. optimization defined by P in (9.87) is solved. y iterations 


are performed and a slightly suboptimal controller achieving y = 2.9 is 
obtained. Moving closer to optimality introduces very fast poles in the 
controller, which if the controller is to be discretized, would ask for a very 
high sample rate. Choosing a slightly suboptimal controller alleviates this 
problem and also improves on the Hy performance. The prefilter is finally 
scaled to achieve perfect steady-state model matching. The controller (with 
the weights W, and W) has 27 states. 


12.3.4 Analysis and simulation results 


Step responses of the linear controlled plant model are shown in Figure 12.13. 
The decoupling is good with less than 10% interactions. Although not shown 


/ yy — i yo — ‘i y1 — 
0}! 0.6 / 
Pa ! Ya Ss / yo --- 
0.4 oat 
YR ese 1 YS tot d, YB om 
0.2 o2f / 
' / 
ree ee ee | —— a 
05 1 : 2 0 05 1 15 2 0 0.5 1 15 2 
Time [sec] Time [sec] Time [sec] 
(a) Step in ry. (b) Step in ro. (c) Step in r3. 


Figure 12.13: Reference step responses. 


here the control inputs were analyzed and the actuator signals were found to 
lie within specified limits. Responses to disturbance steps on the outputs were 
also seen to meet the problem specifications. Notice that because there are 
two degrees-of-freedom in the controller structure, the reference to output and 
disturbance to output transfer functions can be given different bandwidths. 
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The robustness properties of the closed-loop system are now analyzed. 


Figure 12.14 (a) shows the singular values of the sensitivity function. The 
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Figure 12.14: Sensitivity and complementary sensitivity functions. 


peak value is less than 2 (actually it is 1.44 = 3.2 dB), which is considered 
satisfactory. Figure 12.14 (b) shows the maximum singular values of T = 
(I-GW,K2)-'GW, Kz and T; = (I—W, K2G)-!W, KG. Both of these have 
small peaks and go to zero quickly at high frequencies. From Section 9.2.2, 
this indicates good robustness both with respect to multiplicative output and 
multiplicative input plant perturbations. 


Nonlinear simulation results are shown in Figure 12.15. Reference signals 
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Figure 12.15: Nonlinear simulation results 


are given to each of the scaled outputs simultaneously. The solid lines show 
the references, and the dash-dot lines, the outputs. It can be seen that the 
controller exhibits good performance with low interactions. 
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12.3.5 Conclusions 


The case study has demonstrated the ease with which the two degrees-of- 
freedom H. loop-shaping design procedure can be applied to a complex 
engineering system. Some tools for control structure design have also been 
usefully applied to the aero-engine example. We stress that a good control 
structure selection is very important. It results in simpler controllers and in 
general, a simpler design exercise. 


12.4 Distillation process 


A typical distillation column is shown in Figure 10.6 on page 454. The 
overall 5 x 5 control problem is discussed in Example 10.6 (page 453) and 
it is recommended to read this first. The commonly used LV- and DV- 
configurations, which are discussed below, are partially controlled systems 
where 3 loops for liquid level and pressure have already been closed. 

For a general discussion on distillation column control, the reader is also 
referred to Shinskey (1984) and Skogestad and Morari (1987a), and to the 
survey paper by Skogestad (1992). 

We have throughout the book studied a particular high-purity binary 
distillation column with 40 theoretical stages (39 trays and a reboiler) plus a 
total condenser. This is “column A” in Skogestad et al. (1990). The feed is 
an equimolar liquid mixture of two components with a relative volatility of 
1.5. The pressure p is assumed constant (perfect control of p using Vr as an 
input). The operating variables (e.g. reflux and boilup rates) are such that 
we nominally have 99% purity for each product (yp and xp). The nominal 
holdups on all stages, including the reboiler and condenser, are Mj*/F = 0.5 
min. The liquid flow dynamics are modeled by a simple linear relationship, 
L,(t) = LS + (M(t) — Mj)/ti, with tz, = 0.063 min is the same on all 
trays. No actuator or measurement dynamics are included. This results in a 
model with 82 states. This distillation process is difficult to control because of 
strong interactions between the two product compositions. More information, 
including steady-state profiles along the column, is available over the internet. 

This distillation process has been used an example throughout the book, 
and rather than repeating these results we here provide a summary. 


Remark 1 The complete linear distillation column model with 4 inputs 
(L,V,D, B), 4 outputs (yo, 2B, Mp, Mp), 2 disturbances (F, zr) and 82 states is 
available over the internet. The states are the mole fractions and liquid holdups on 
each of the 41 stages. By closing the two level loops (Mp and Mz) this model may 
be used to generate the model for any configuration (LV, DV, etc.). For example, 
the model for the LV-configuration is generated using the MATLAB commands in 
table 12.3. 
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Table 12.3: MATLAB program for generating model of 
various distillation configurations. 


¥, Uses MATLAB Mu toolbox 

% G4: State-space model of column (4 inputs, 2 disturbances, 4 outputs, 82 states) 
¥, Level controllers using D and B: 

Kd = 10; Kb = 10; 

¥, P-controllers (bandwidth = 10 rad/min) 

y 

vA 

% Now generate the LV-configuration from G4 using sysic: 

systemnames = ’G4 Kd Kb’; 

inputvar = ’?[L(1); V(1); d(2)]’; 


outputvar = ’[G4(1);G4(2)]’; 
input_to_G4 = ’[L; V; Kd; Kb; d ]’; 
input_to_Kd = °[G4(3)]’; 


input_to_Kb = °[G4(4)]’; 

sysoutname =’Glv’; 

oe eee sysic; 

% Modifications needed to generate DV-configuration: 

Kl = 10; Kb = 10; 

systemnames = ’G4 Kl Kb’; 

inputvar = ’[D(1); V(1); d(2)]’; outputvar = °[G4(1);G4(2)]’; 
input_to_G4 °>[K1; V; D; Kb; d ]’; 

input_to_Kl ?[G4(3)]’; input_to_Kb = ’[G4(4)]’; 

sysoutname =’Gdv’; cleanupsysic=’yes’; sysic; 

y 

h 

% Modifications needed to generate DB-configuration: 

Kl = 10; Kv = 10; 

systemnames = ’G4 Kl Kv’; 

inputvar = ’[D(1); B(1); d(2)]’; outputvar = °[G4(1);G4(2)]’; 
input_to_G4 >[K1; Kv; D; B; d ]°’; 

input_to_K1l ?[G4(3)]’; input_to_Kv = ’[G4(4)]’; 

sysoutname =’Gdb’; cleanupsysic=’yes’; sysic; 
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Remark 2 A 5-state LV-model, obtained by model reducing the above model, is 
given on page 527. This model is also available over the internet. 


12.4.1 Idealized LV-model 


The following idealized model of the distillation process, originally from 
Skogestad et al. (1988), has been used in examples throughout the book: 
G(s) 


1 87.8 —86.4 (12.17) 


~ 75s+1]108.2 —109.6 


The inputs are the reflux (LZ) and boilup (V), and the controlled outputs are 
the top and bottom product compositions (yp and xg). This is a very crude 
model of the distillation process, but it provides an excellent example of an 
ill-conditioned process where control is difficult, primarily due to the presence 
of input uncertainty. 

We refer the reader to the following places in the book where the model 
(12.17) is used: 


Example 3.6 (page 81): SVD-analysis. The singular values are plotted as a 
function of frequency in Figure 3.6(b) on page 83. 


Example 3.7 (page 82): Discussion of physics and interpretation of directions. 


Example 3.11 (page 98): The condition number, 7(G), is 141.7, and the 1, 1- 
element of the RGA, A11(G), is 35.1 (at all frequencies). 


Motivating Example No. 2 (page 101): Introduction to robustness problems 
with inverse-based controller using simulation with 20% input 
uncertainty. 


Exercise 3.7 (page 104): Attempt to robustify above inverse-based design 
using McFarlane-Glover H loop-shaping procedure. 


Etercise 3.8 (page 104): Design of robust SVD-controller. 


Exercise 3.9 (page 104): Combined input and output uncertainty for inverse- 
based controller. 


Example 6.5 (page 247): Magnitude of inputs for rejecting disturbances (in 
feed rate and feed composition). at steady state. 


Example 6.6 (page 257): Sensitivity to input uncertainty with feedforward 
control (RGA). 


Example 6.7 (page 257): Sensitivity to input uncertainty with inverse-based 
controller, sensitivity peak (RGA). 
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Example 6.11 (page 262): Sensitivity to element-by-element uncertainty (rel- 
evant for identification). 


Example 8.1 (page 311): Coupling between uncertainty in transfer function 
elements. 


Example in Section 8.11.3 (page 350): js for robust performance which ex- 
plains poor performance in Motivating Example No. 2. 


Example in Section 8.12.4 (page 358): Design of p-optimal controller using 
Dk-iteration. 


The model in (12.17) has also been the basis for two benchmark problems. 

Original benchmark problem. The original control problem was 
formulated by Skogestad et al. (1988) as a bound on the weighted sensitivity 
with frequency-bounded input uncertainty. The optimal solution to this 
problem is provided by the one degree-of-freedom p-optimal controller given 
in the example in Section 8.12.4 where a peak ji-value of 0.974 (remark 1 on 
page 1) was obtained. 

CDC benchmark problem. The original problem formulation is 
unrealistic in that there is no bound on the input magnitudes. Furthermore, 
the bounds on performance and uncertainty are given in the frequency domain 
(in terms of weighted H.. norm), whereas many engineers feel time domain 
specifications are more realistic. Limebeer (1991) therefore suggested the 
following CDC-specifications. The set of plants II is defined by 


Gays 1 0.878 —0.864] [| kye7%* 0 
~ 75s +1] 1.082 —1.096 0 ke 28 


ki €[0.8 1.2], 6;€[0 1.0] (12.18) 


In physical terms this means 20% gain uncertainty and up to 1 min delay in 
each input channel. The specification is to achieve for for every plant G € II: 


S1: Closed-loop stability. 


S2: Fora unit step demand in channel 1 at t = 0 the plant output y (tracking 
and ye (interaction) should satisfy: 


yi(t) > 0.9 for all ¢ > 30 min. 
yi(t) < 1.1 for all t. 

0.99 < y:(c0) < 1.01. 

yi(t) < 0.5 for all ¢. 

—0.01 < yo(co) < 0.01. 


Corresponding requirements hold for a unit step demand in channel 2. 
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S3: a(K,S) < 0.316, Vw. 
S4: o(GK,) <1 for w > 150. 


Note that a two degrees-of-freedom controller may be used and K, then refers 
to the feedback plant of the controller. In practice, specification $4 is indirectly 
satisfied by $3. Note that the uncertainty description G, = G(I+w ;Ar) with 
wr = ope (as used in the examples in the book) only allows for about 0.9 
min time delay error. To get a weight w7(s) which includes the uncertainty 
in (12.18) we may use the procedure described in Example 7.5, eq. (7.25) or 
(7.26) with r, = 0.2 and Onax = 1. 

Several designs have been presented which satisfy the specifications for the 
CDC-problem in (12.18). For example, a two degrees-of-freedom H.. loop- 
shaping design is given by Limebeer et al. (1993). A two degrees-of- freedom 
L-optimal design is presented by Lundstrém, Skogestad and Doyle (1996). 


12.4.2 Detailed LV-model 


We have also in the book used a 5-state dynamic model of the same distillation 
process which includes liquid flow dynamics (in addition to the composition 
dynamics) as well disturbances. This 5-state model was obtained from model 
reduction of the detailed model with 82 states. The steady-state gains for the 
two disturbances are given in (10.75). 

The 5-state model is similar to (12.17) at low frequencies, but the model 
is much less interactive at higher frequencies. The physical reason is that the 
liquid flow dynamics decouple the response and make G(jw) upper triangular 
at higher frequencies. The effect is illustrated in Figure 12.16 where we show 
the singular values and the magnitude of the RGA-elements are a function of 
frequency. As a comparison, for the simplified model in (12.17), A1i1(G) = 35.1 
at all frequencies (and not just at steady-state). The implication is that control 
at crossover frequencies is easier than expected from the simplified model 
(12.17). 

Applications based on the 5-state model are found in: 


Example 10.8 (page 458): Controllability analysis of partial control and 
feedforward control. 


Example in Section 10.11 (page 474): Controllability analysis of decentral- 
ized control. 


Details about 5-state model. The state-space realization is 


Gai2 Face Cute! | 4 Pa | (12.19) 
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Figure 12.16: Detailed 5-state model of distillation column. 


where 
—.005131 0 0 0 0 —.629 .624 
0 —.07366 0 0 0 .055 —.172 
A= 0 0 —.1829 0 0 , B= | .030 —.108 
0 0 0 —.4620 -9895 —.186 —.139 
0 0 0 —.9895 —.4620 —1.23 —.056 
—0.062 —0.067 
C= eee —.5170 .3386 —.1633 aire | Ba _ A ane ete 
an ’ oa . —0. 
8913 4728 -9876 .8425 .2186 0.188 0.027 
—0.045 0.014 


Scaling. The model is scaled such that magnitude of 1 corresponds to: 0.01 
mole fraction units for each output (yp and xg), the nominal feed flowrate 
for the two inputs (£Z and V), and a 20% change for each disturbance (feed 
rate F and feed composition zr). Notice that the steady-state gains computed 
with this model are slightly different from the ones used in the examples. 


Remark.A similar dynamic LV-model, but with 8 states, is given by Green and 
Limebeer (1995), who also designed 1. loop-shaping controllers. 


Exercise 12.1 Redo the p-optimal design using DK-iteration in Section 8.12.4 
using the model (12.19). 


12.4.3 Idealized DV-model 


Finally, we have also made use of an idealized model for the DV-configuration: 
1 —87. : 
G(s) 87.8 1.4 


~ 75s+1|-108.2 —1.4 
In this case the condition number 7(G) = 70.8 is still large, but the RGA 
elements are small (about 0.5). 


(12.20) 
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Example 6.8 (page 258) Bounds on sensitivity peak show that inverse-based 
controller is robust with respect to diagonal input uncertainty. 


Example 8.6 (page 332): for robust stability with a diagonal controller 
is computed. The difference between diagonal and full-block input 
uncertainty is significant. 


Remark. In practice, the DV-configuration may not be as favourable as indicated 
by these examples, because the level controller at the top of the column is not perfect 
as was assumed when deriving (12.20). 


12.4.4 Further distillation case studies 


The full distillation model, which is available over the internet, may form 
the basis for several case studies (projects). These could include input-output 
controllability analysis, controller design, robustness analysis, and closed-loop 
simulation. The following cases may be considered: 


1. Model with 4 inputs and 4 outputs 

2. LV-configuration (studied extensively in the book) 
3. DV-configuration 

4. DB-configuration 


The models in the latter three cases are generated from the 4 x 4 model 
by closing two level loops (see the MATLAB file in Table 12.3) to get a 
partially controlled plant with 2 inputs and 2 outputs (in addition to the two 
disturbances). 


Remark 1 For the DV- and DB-configurations the resulting model depends quite 
strongly on the tuning of the levelk loops, so one may consider separateley the two 
cases of tight level control (e.g. K = 10 as in Table 12.3) or a loosely tuned level 
controller (e.g. K = 0.5 corresponding to a time constant of 2 min). The level control 
tuning may also be considered as a source of uncertainty. 


Remark 2 The models do not include actuator or measurement dynamics, which 
may also be considered as a source of uncertainty. 


Remark 3 The model for the DB-configuration contains a pure integrator in one 
direction, and the steady-state gain matrix is therefore singular. 
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APPENDIX A 


MATRIX THEORY AND 
NORMS 


Most of the topics in this Appendix are included as background material which is 
needed for the book, and should ideally be studied before reading Chapter 3. 

After studying the Appendix the reader should feel comfortable with a range 
of mathematical tools including eigenvalues, eigenvectors and the singular value 
decomposition, and should appreciate the difference between various norms for 
vectors, matrices, signals and systems, and how these norms can be used to measure 
performance. 

The main references are: Strang (1976) (for understanding), Golub and van Loan 
(1989) (for some details), Horn and Johnson (1985) (for more details) and Zhou 
et al. (1996) (for norms). 


A.1_ Basics 


Let us start with a complex scalar 
c=a+jb, where a = Re(c),b = Im(c) 


To compute the magnitude |c|, we multiply c by its conjugate ¢ 4 a— jb. Then 


Ie| = VJVa= Va Pe = Ja? + b2 
A complex column vector a with m components (elements) is written 
ay 
a2 
am 


T 


where a; is a complex scalar. a’ is used to denote a row vector. 
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Now consider a complex | x m matrix A with elements aj; = Re(ai;) + 
j Im(a;;). lis the number of rows (number of “outputs” when viewed as an 
operator) and m is the number of columns (“inputs”). Mathematically, we 
write A € C'*™ if A is a complex matrix, or A € R!*™ if A is a real matrix. 
Note that a column vector a with m elements may be viewed as an m x 1 
matrix. 

The transpose of a matrix A is A? (with elements aj;;), the conjugate is A 
(with elements Re(a;;) — 7 Im(a;;)), the conjugate transpose (or Hermitian 
adjoint) matrix is A? 4 ArT (with elements Re(aj;;) — jIm/(a;,;)), the trace is 
trA (sum of diagonal elements), and the determinant is det A. The inverse of 
a non-singular matrix is 4~!, and is given by 


_1 _ adjA 
~ det A 


(A.1) 


where adjA is the adjugate (or “classical adjoint”) of A which is the 
transposed matrix of cofactors of A. Here the ij’th cofactor of A is 


cij = [adj A]; 2 (—1)*/ det 44 (A.2) 


where A” is a submatrix formed by deleting row i and column j of A. For 
example, for a 2 x 2 matrix we have 


ai, 12 
A= ; det A = a11A22 — a12421 
a2, G22 


he 1 a22 —a12 
A Gee aoe ae (A.3) 


We also have, assuming the inverses exist, that: 


(AB)? = BT AT, (AB)? =B" A", (AB) 1=B 1A (A.4) 


A.1.1 Some useful matrix identities 


Lemma A.1 The matrix inversion lemma. Let A;,A2,A3 and A, be 
matrices with compatible dimension such that the matrices A2A3A4 and 
(Ay + A2A3A4) are defined. Also assume that the inverses given below exist. 
Then 


(Ay + Ay A3 Aq)! = AS as Ay’ As (Ag Az "Ao + A3 ia "As Ay (A.5 


Proof: Postmultiply the right hand side in (A.5) with Ai + A2A3Au4. This gives the 
identity matrix. 
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Lemma A.2 Inverse of partitioned matrix. If Ajj) and X~! exist then 


Avy. As). WARS Ar AN Ay AS aA (A.6) 
= a1 4A! Ko} 


where X = Ago — Ao Ay Ate is the Schur complement of Ay, in A; also see 
(A.14). Similarly if Ajy and Y~" exist then 


ae roe y" ‘ ~Y'ApAgy (A.7) 
—Aj553 An Y~' A5y + Ady Ant Y~1A12 Ady 


where Y = Ay, — Aj2 A559 Aor is the Schur complement of Azo in A; also see 
(A. 15). 


A.1.2. Some determinant identities 


The determinant is defined only for square matrices, so let A be ann x n 
matrix. A matrix is non-singular if det A is non-zero. The determinant may 
be defined inductively as det A = )7j_, aijeij (expansion along column j) or 
det A = pe aijcij (expansion along row i), where c;j is the ij’th cofactor 
given in (A.2). This inductive definition begins by defining the determinant of 
an 1x 1 matrix (a scalar) to be the value of the scalar, i.e. det a = a. We then 
get for a 2 x 2 matrix det A = a4,@22 — a 2a; and so on. From the definition 
we directly get that det A = det A7. Some other determinant identities are 


given below: 


1. Let A be a non-singular matrix. Then 

det A~' = 1/det A (A.8 
2. Let c be a complex scalar and A an n x n matrix. Then 

det(cA) = c” det(A) (A.9 


3. Let A; and A» be square matrices of the same dimension. Then 


det (Aj Az) = det (A241) = det Ay - det A» (A.10 


4. Let A; and As be matrices of compatible dimensions such that both 
matrices Ay Ay and AA; are square (but A; and A» need not themselves 
be square). Then 


This is actually a special case of Schur’s formula given next. (A.11) is 
important in control area as it yields, e.g., det(I + GK) = det(I+ KG). 
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5. The determinant of a triangular or block-triangular matrix is the product 
of the determinants of the diagonal blocks: 


Au Ai2| _ Ay O | _ 
act 0 | = det bs ie = det (Ai) det (A22) (A.12) 


6. Schur’s formula for the determinant of a partitioned matrix: 


det be | = det(Aj1) - det(Az2 — Aoi Aj} Ai2) 
= det (A22) i det(Ai1 aa Ajo A55 Ao1) (A.13) 
where it is assumed that A,, and/or Ago are non-singular. 


Proof: Note that A has the following decomposition if Aj is nonsingular: 


Au Ai} _ I 0] [Aun oO] | 2 Ap Ar 
a | 7 asta "| 0 3 ; a (4) 


where X = Aono — Ao Aj} At. Computing the determinant of A using (A.10) and 
(A.12) then proves the first part of (A.13). Similarly, if Ao2 is nonsingular, 


Au Aw] _ [Tt Awd |[Y 0 I 0 (A.15 
Aa Ao2 > 0 z 0 Ago Ajy Ant I ‘ 


where Y = Aj, — A12A5q Aor, and the last part of (A.13) follows. 


A.2 Eigenvalues and eigenvectors 


Definition A.3 Eigenvalues and eigenvectors. Let A be a square n x n 
matriz. The eigenvalues \;, i =1,...,n, are the n solutions to the n’th order 
characteristic equation 

det(A — AT) = 0 (A.16) 


The (right) eigenvector t; corresponding to the eigenvalue A; ts the nontrivial 
solution (t; #0) to 


The corresponding left eigenvectors q; satisfy 
gGi(A-AT)=0 & qflA=rof (A.18) 


When we just say eigenvector we mean the right eigenvector. 
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The eigenvalues are sometimes called characteristic gains. The set of 
eigenvalues of A is called the spectrum of A. The largest of the absolute 


values of the eigenvalues of A is the spectral radius of A, p(A) 2 max; |\;(A)]. 

Note that if t is an eigenvector then so is at for any constant a. Therefore, 
the eigenvectors are usually normalized to have unit length, i.e. t/t; = 1. An 
important result for eigenvectors is that eigenvectors corresponding to distinct 
eigenvalues are always linearly independent. For repeated eigenvalues, this 
may not always be the case, that is, not all n x n matrices have n linearly 
independent eigenvectors (these are the so-called “defective” matrices). 

The eigenvectors may be collected as columns in the matrix T and the 
eigenvalues 1, A2,.--, An as diagonal elements in the matrix A: 


Toa Ati tos obey ty fi A = diag{Aq, A2,---,An} (A.19) 
Then we may write (A.17) in the following form 
AT=TA (A.20) 


Let us now consider using T for “diagonalization” of the matrix A when 
its eigenvectors are linearly independent such that T+ exists (which always 
happens if the eigenvalues are distinct, but may also happen in other cases, e.g. 
for A = I). From (4.20) we then get that the eigenvector matrix diagonalizes 
A: 

A=T'AT (A.21) 


A.2.1 Facts for eigenvalues: 


Let in the following \; denote the eigenvalues of A. 


1. The sum of the eigenvalues of A is equal to trace of A (sum of the diagonal 
elements): trA = 0, ;. 

2. The product of the eigenvalues of A is equal to the determinant of A: 
det A = TI, rj- 

3. The eigenvalues of an upper or lower triangular matrix are equal to its 
diagonal elements. 

4. For a real matrix the eigenvalues are either real, or occur in complex 
conjugate pairs. 

5. Aand A” have the same eigenvalues (but different eigenvectors). 

6. The inverse A~? exists if and only if all eigenvalues of A are non-zero. In 

this case A~+ has the eigenvalues 1/A1,...,1/An- 

. The matrix A+ cl has eigenvalues A; + c. 

. The matrix cA* where k is an integer has eigenvalues c\. 

9. Consider the | x m matrix A and the m x | matrix B. Then the | x | 
matrix AB and the m x m matrix BA have the same non-zero eigenvalues. 


oa | 
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To be more specific assume | > m. Then the matrix AB has the same m 
eigenvalues as BA plus | — m eigenvalues which are identically equal to 
zero. 

10. The eigenvalues are invariant under similarity transformations, that is, A 
and DAD~—' have the same eigenvalues. 

11. The same eigenvectors diagonalize the matrix A and the matrix (I+ A)7}. 
( 
em Proof: T~'(I + A)-'T =(T7'(I+ A)T) + = (I+ A)". ) 

12. Gershgorin’s theorem The eigenvalues of the n x n matrix A lie in the 
union of n circles in the complex plane, each with centre a;; and radius 
ri; = )0j4i |@i3| (sum of off-diagonal elements in row 7). They also lie in the 
union of n circles, each with centre aj; and radius r; = )7 4; |aji| (sum of 
off-diagonal elements in column ?). 


From the above properties we have, for example, that 


iL, 1 


MS) EEE) oy oe 


(A.22) 


In this book we consider eigenvalues of two completely different matrixes, 
namely of the real state matrix, A, and of the complex transfer matrix at a 
given frequency, e.g. L(jw), and it is important to realize the difference. This 
is discussed next. 


A.2.2 Eigenvalues of the state matrix 


Consider a system described by the linear differential equations: 
& = Ar+ Bu (A.23) 


Unless A is diagonal this is a set of coupled differential equations. For 
simplicity, assume that the eigenvectors of A are linearly independent and 
introduce the new state vector z = T~'z, that is, x = Tz. We then get 


Tz=ATz+Bu @& #=Az+T'Bu (A.24) 


which is a set of uncoupled differential equations in terms of the new states 
z = Tx. The unforced solution (i.e., with u = 0) is z; = z;e>** where 29; is 
the value of the state at t= 0. If \; is real then we see that we have stability 
of this mode (z; > 0 as t > oo) if and only if A; < 0. If A; = Red; + jIm); 
is complex then we get e*! = eR" (cos(ImAjt) + j sin(Im),;t)) and we have 
stability (z; > 0 as t > oo) if and only if Red; < 0 (the fact that the new 
state z; is complex is of no concern since the real physical states x = Tz of 
course are real). This is a general result: A linear system is stable if and only 
if all the eigenvalues of the state matrix A have real parts less than 0, that is, 
lie in the open left-half plane. 
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A.2.3 Ejigenvalues of transfer functions 


The eigenvalues of the loop transfer function matrix, ;(L(jw)), are sometimes 
called the characteristic loci, and they may to some degree be used to 
generalize L(jw) for a scalar system. We make use of A;(L) to study the 
stability of the MA-structure in Chapter 8 where L = MA. Even more 
important in this context is the spectral radius, p(L) = max; |A;(L(jw))|. 


A.3 Singular Value Decomposition 


Definition A.4 Unitary matrix. A (compler) matrix U is unitary if 
vu =u (A.25) 


All the eigenvalues of a unitary matrix have absolute value equal to 1, and 
all its singular values are therefore (as we shall see from the definition below) 
equal to 1. 


Definition A.5 SVD. Any complex 1 xm matrix A may be decomposed into 
a singular value decomposition: 


A=Uxv" (A.26) 


where the | x 1 matrit U and the m x m matriz V are unitary, andl xm 
matriz % contains a diagonal matriz %, with the real, non-negative singular 
values, oi, arranged in a descending order as in 


eal l>m (A.27 
or 
S=[EY, 0]; l<m (A.28 
where 
¥y = diag{o1,02,...,0%}; k= min{l,m} (A.29 
and 
ae ae eee eae (A.30 


The unitary matrices U and V form orthonormal bases for the column 
(output) space and the row (input) space of A. Here the column vectors of V, 
denoted v;, are called right or input singular vectors and the column vectors 
of U, denoted u;, are called left or output singular vectors. 

Note that this decomposition is not unique since A = ulavet where 
U' = US, V' = VS~' and S = diag{e/%} and 6; is any real number, is 
also an SVD of A. However, the singular values, o;’, are unique. 
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The singular values are the square roots of the k = min(l,m) largest 
eigenvalues of both AA” and A# A. We have 


oi(A) = yf di(AMA) = /A,(AA") (A.31) 


Also, the columns of U and V are unit eigenvectors of AA” and A” A, 
respectively. To derive (A.31) write 


AAP = (USV")(USV")# = (USV"4)(VEZUF) =USE"UF (A382) 
or equivalently since U is unitary and satisfies U" = U7!: 
(AA )U = UD" (A.33) 


We then see that U is the matrix of eigenvectors of AA” and {o?} are its 
eigenvalues. Similarly, we have that V is the matrix of eigenvectors of A” A. 


A.3.1 Rank 


Definition A.6 The rank of a matrix is equal to the number of non-zero 
singular values of the matrit. Let rank(A) = r, then the matriz A is called 
rank deficient if r < k = min(l,m), and we have singular values o; = 0 for 
i=rt+l,...k. A rank deficient square matriz is a singular matriz (non-square 
matrices are always singular). 


The rank is unchanged upon left or right multiplication by a non-singular 
matrix. Furthermore, for an | x m-matrix A and an m x p-matrix B, the rank 
of their product AB is bounded as follows (Sylvester’s inequality) 


rank(A) + rank(B) —_m < rank(AB) < min{rank(A),rank(B)} — (4.34) 


A.3.2 Singular values for 2 x 2 matrix 


In general, the singular values must be computed numerically. For 2 x 2 
matrices an analytic expression is easily derived. Introduce 


bStr(A¥ A) = So |aijl?, ¢ 2 det(A” A) 
aij 


Furthermore, the sum of the eigenvalues of a matrix is equal to its trace and 
the product is equal to its determinant, so 


Ay + A2 = 4, Ay: Ag =e 
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Solving with respect to Ay and Ag, and using o;(A) = \/A;(A” A) then yields: 
b+ /b — 4c Ee 4 
aeaar vo 25, cee (A.35) 
1 2 ; 
For example, for A = 3 4| we have 6 = Yo la,;/> = 1+44+9+4 16 = 30, 


= (det A)? = (—2)? = 4, and we get G(A) = 5.465 and o(A) = 0.366. 
Note that for singular 2 x 2 matrices (with det A = 0 and o(A) = 0) we get 


a(A) = J/> |aij|?) = ||Allz (the Frobenius norm), which is actually a special 
case of (A.124). 


A.3.3. SVD of inverse 
Since A= UNV" we get, provided the m x m_A is non-singular, that 


At=v»-lu"# (A.36) 


This is the SVD of A~! but with the order of the singular values reversed. 
Let 7 =m —i+1. Then it follows from (4.36) that 


oi(A*) =1/o;(A), ui(A7*) = 0;(A),  v:(A7*) = uy (A) (A.37) 


and in particular 


a(A~") = 1/a(A) (A.38) 


A.3.4 Singular value inequalities 


The singular values bound the magnitude of the eigenvalues (also see (A.114)): 
o(A) <|Ai(A)| < aA) (A.39 
The following is obvious from the SVD-definition: 
a(A") =a(A) and a(AT) =a(A) (A.40 
This important property is proved below (eq. A.95): 
(AB) < a(A)a(B) (A.41 
For a non-singular A (or B) we also have a lower bound on a(AB) 
a(A)a(B) <a(AB) or G(A)o(B) < a(AB) (A.42 
We also have a lower bound on the minimum singular value 


o(A)o(B) < (AB) (A.43 
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For a partitioned matrix the following inequalities are useful: 


max{a(A),a(B)} <a zi < V2max{a(A),a(B)} (A.44) 


5 ‘| < (A) +.a(B) (A.45) 


The following equality for a block-diagonal matrix is used extensively in the 
book: 


5 ls zl = max{a(A),a(B)} (A.46) 


Another very useful result is Fan’s theorem (Horn and Johnson, 1985, p. 140 
and p. 178): 


oi(A) — @(B) <oi(A +B) < oi(A) +4(B) (A.47) 


Two special cases of (A.47) are: 


|o(A) — a(B)| < (A+ B) <a(A) +4(B) (A.48) 
o(A) — a(B) < o(A+B) <a(A) +4(B) (A.49) 

(A.49) yields 
a(A) -1<o0(1+A) <a(A)4+1 (A.50) 


On combining (A.38) and (A.50) we get a relationship that is useful when 
evaluating the amplification for closed-loop systems: 


o(A) -1< ——__ <a(A)+1 (A.51) 


A.3.5 Singularity of matrix A+ E 
From the left inequality in (A.49) we find that 


a(E) < a(A) (A.52) 


implies that o(A+) > 0, and thus the smallest singular value o(A) measures 
how near the matrix A is to being singular or rank deficient. This test is 
often used in numerical analysis, and it is also an important inequality in the 
formulation of various robustness tests. 
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A.3.6 SVD as sum of rank 1 matrices 


Let r denote the rank of the 1 x m matrix A. We may consider then the SVD 
as a decomposition of A into r 1 x m matrices, each of rank 1. We have 


A=UIV¥ =o oul? (A.53) 
i=1 


(the remaining terms from r+ 1 to & = min{l,m} have singular values equal 
to 0 and give no contribution to the sum). The first and most important 
submatrix is given by A; = oyu! If we now consider the residual matrix 


Al =A-—A,=A-oyuvl! (A.54 
then it may be shown that 
O1 (A!) = 02 (A) (A.55 


that is, the largest singular value of A’ is equal to the second singular value 
of the original matrix. This shows that the direction corresponding to o2(A 
is the second most important direction, and so on. 


A.3.7 Economy size SVD 


Since there are only r = rank(A) < min{l,m} non-zero singular values, and 
since only the non-zero singular values contribute to the overall product, the 
singular value decomposition of A is sometimes written as an economy size 
SVD 

Alxm = BRIERE (yrs (A.56) 


where the matrices U; and V, contain only the r first columns of the matrices 
U and V introduced above. Here we have used the notation A!*™ to indicate 
that A is an 1 x m matrix. 


Remark. The “economy size SVD” presently used in MATLAB is not quite as 


economic as the one given in (A.56) as it uses m instead of r. 


A.3.8 Pseudo-inverse (Generalized inverse) 


Consider the linear set of equations 
y = Av (A.57) 


with a given 1 x 1 vector y and a given 1 x m matrix A. A least squares solution 
to (A.57) is an m x 1 vector x such that ||z||o = \/27 + 73 --- a2, is minimized 
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among all vectors for which ||y — Ax||2 is minimized, and is given in terms of 
the pseudo-inverse (Moore-Penrose generalized inverse) of A: 


c= Aly (A.58) 
The pseudo-inverse may be obtained from an SVD of A= USV# by 


Tr 


1 
At = V5 UF => PaeY vjull (A.59) 
i-1 °* 


where r is the number of non-zero singular values of A. We have that 
o(A) =1/a(A") (A.60) 


Note that At exists for any matrix A, even for a singular square matrix and 
a non-square matrix. The pseudo-inverse also satisfies 


AA'A=A and ATAAT = At 
Note the following cases (r is the rank of A): 


1. r=l=™m,ie., Ais a non-singular. In this case At = A7! is the inverse of 
the matrix. 

2.r =m <1,ie., A has full column rank. This is the “conventional least 
square’s problem” where we want to minimize ||y— Az|l2, and the solution 
is 

At = (A A)-1A# (A.61) 
In this case AtA = J so Al is a left inverse of A. 

3. r=l<™m,i.e., A has full row rank. In this case we have an infinite number 

of solutions to (A.57) and we seek the one that minimizes ||2||2. We get 


At = AM (AA¥)"1 (A.62) 


In this case AAt = I so Af is a right inverse of A. 

4. r < k = min{l,m} (general case). In this case both matrices A” A and 
AA® are rank deficient, and we have to use the SVD to obtain the pseudo- 
inverse. In this case A has neither a left or right inverse. 


Principal component regression (PCR) 


We note that the pseudo-inverse in (A.59) may be very sensitive to noise and 
“blow up” if the smallest non-zero singular value, 0, is small. In the PCR 
method one avoids this problem by using only the q < r first singular values 
which can be distinguished from the noise. The PCR pseudo-inverse then 


becomes 
q 


1 
Avor = Ss —vjuj! (A.63) 


i=1 °? 
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Remark. This is similar in spirit to the use of Hankel singular values for model 
reduction. 


A.3.9 Condition number 


The condition number of a matrix is in this book defined as the ratio 
(A) = 01(A)/on(A) = (A)/o(A) (A.64) 


where & = min(/,m). A matrix with a large condition number is said to be 
ill-conditioned. This definition yields an infinite condition number for rank 
deficient matrices. For a non-singular matrix we get from (A.38) 


¥(A) = (A) -3(4-) (A.65) 


Other definitions for the condition number of a non-singular matrix are also 
in use, for example, 
p(A) = |All AI (A.66) 


where ||A|| denotes any matrix norm. If we use the induced 2-norm (singular 
value) then this yields (A.65). From (A.65) and (A.41) we get for non-singular 
matrices 


(AB) < y(A)(B) (A.67) 


Note that the condition number depends strongly on the scaling of the inputs 
and outputs. To be more specific let y’ = Doy and u’ = Dru where the scaling 
matrices Do and D; are diagonal and real. Then the condition numbers of the 
matrices A and Do AD; may in general be arbitrarily far apart. In general, 
the scalings Do and D; should be selected on physical ground, for example, 
by dividing each input and output by its largest expected or desired value as 
discussed in Chapter 1.4. 

The minimized condition number is obtained by minimizing the 
condition number over all possible scalings. We have 


7"(G) 2 min y(DoGDr) (A.68) 


where D; and Do are diagonal scaling matrices. For a 2 x 2 matrix, the 
minimized condition number is given by (Grosdidier et. al., 1985): 


V(G) = [Alla + Al -1 (A.69) 


where ||A|l|;; is the induced 1-norm (maximum column sum) of the RGA- 
matrix of G. Note that, A(G) = I and y*(G) = 1 for a triangular 2 x 2 
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matrix. If we allow only scaling on one side then we get the input and output 
minimized condition numbers: 


W(@) 2 miny(GD;), 7G(G) 2 miny(DoG) (A.70) 


To compute these minimized condition numbers we define 


0 Gt 
n= [2 & es 
Then we have as proven by Braatz and Morari (1994) 
y*(G) = main o(DHD~'), D=diag{D;,Do} (A.72) 
I;,VO 


4/y(G) =mina(DHD~'), D=diag{D,,I} (A.73) 


Dr 
\/¥Oo(G) = min a(DHD~'), D=diag{I,Do} (A.74) 


These convex optimization problems may be solved using available software 
for the upper bound on the structured singular value a(H), see (8.95). In 
calculating wa(H), we use for y*(G) the structure A = diag{Agiag, Adiag}, 
for y7(G) the structure A = diag{Adiag, Arun}, and for y6(G) the structure 
A = diag{Afun, Adiag }- 


A.4_ Relative Gain Array 


The RGA was originally introduced by Bristol (1966) to measure steady-state 
interactions caused by decentralized diagonal control, but the RGA also has a 
number of important properties as a frequency-dependent function. Many of 
these properties were stated by Bristol, but they were not proven rigorously 
until the work by Grosdidier et al. (1985). Some additional properties are 
given in Hovd and Skogestad (1992). 

The Relative Gain Array of a complex non-singular m xm matrix A, denoted 
RGA(A) or A(A), is a complex m x m matrix defined by 


RGA(A) = A(A) 2 Ax (472)7 (A.75) 


where the operation x denotes element by element multiplication (Hadamard 
or Schur product). If A is real then A(A) is also real. Example: 


fi =2) ja [04 02 _ [04 0.6 
A= [5 1]. 4 Ser: FpeAG Oca ye wal 
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A.4.1. Properties of RGA-matrix 


Most of the following properties follow directly if we write the RGA-elements 
in the following form 


fe Cj 


ig EY aij det AM 
ve det A 


det A 


rij = aiyj:a 1) (A.76) 
Here aj; denotes the ji’th element of the matrix A7!, A denotes the matrix 
A with row i and column j deleted, and ¢; = (—1)'*/ det AY is the ij’th 
cofactor of the matrix A. 

For any non-singular m x m matrix A the following holds: 


1. A(A7!) = A(AT) = A(A)T 

2. Any permutation of the rows and columns of A results in the same 
permutation in the RGA. That is, A(P; AP2) = P,A(A)P2 where P; and P 
are permutation matrices. (A permutation matrix has a single 1 in every 
row and column and all other elements equal to 0.) A(P) = P for any 
perturbation matrix. 

3. The sum of all elements of each row and each column of the RGA is 1. 
That is, Wei rij =1and ae rij =; 

4. A(A) = I if and only if A is a lower or upper triangular matrix; and in 
particular the RGA of a diagonal matrix is the identity matrix. 

5. The RGA is scaling invariant. More precisely, A(D;AD2) = A(A) where 
D, and Dg are diagonal matrices. 

6. The RGA is a measure of sensitivity to relative element-by-element 
uncertainty in the matrix. More precisely, the matrix A becomes singular 
if we make a relative change —1/,; in its ij’th element, that is, if a single 
element in A is perturbed from a;,; to a’j; = a;j(1 — es ). 

7. The norm of the RGA is closely related to the minimized condition number, 
y*(A) = minp, ,p, y(D1 AD2) where D; and Dg is any diagonal matrix. We 
have the following lower and conjectured upper bound on *(A) 


1 
y*(A) 


I|Allm — <7°(A) S |lAllsum + &(m) (A.77) 


where k(m) is a constant, and ||A||m, a 2max{|lAllez, || Allico} and ||Allsum = 
4, Aj] (the matrix norms are defined in Section A.5.2). The lower 
bound is proven by Nett and Manousiouthakis (1987). The upper bound 
is proven for 2 x 2 matrices with k(2) = 0 (Grosdidier et al., 1985), but 
it is only conjectured for the general case with k(3) = 1 and k(4) = 2 
(Skogestad and Morari, 1987c; Nett and Manousiouthakis, 1987). Note 
that ||Allm < ||Allsum where the equality always holds in the 2 x 2 case. 
Consequently, for 2 x 2 matrices y* and ||A||sum are always very close in 
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magnitude (also see (A.69)): 


1 
y*(A) 


2x2 matrix: ||Allsum — <7*(A) < |[Allsum (A.78) 


8. The diagonal elements of the matrix ADA™! are given in terms of the 
corresponding row-elements of the RGA. (Skogestad and Morari, 1987c). 
More precisely, for any diagonal matrix D = diag{d;} we have 

[ADA~]is = [A“'DA]is = S> aj (A)d; (A.79) 


god 


9. It follows from Property 3 that A always has at least one eigenvalue and 
one singular value equal to 1. 


Proof of some of the properties: Property 3: Since AA~' = I it follows that 

a aij4;; = 1. From the definition of the RGA we then have that yy A = 1. 
Property 4: If the matrix is upper triangular then a;; = 0 for 7 > j. It then follows 
that c;; = 0 for j > 7% and all the off-diagonal RGA-elements are zero. Property 5: 
Let A’! = Di AD». Then aj; = diidajai; and ay = acme and the result follows. 
Property 6: The determinant can be evaluated by expanding it in terms of the any 
row or column, e.g. by row i, det A = )>,(—1)'*a;; det A”. Let A’ denote A with 
a';; substituted for a;;. By expanding the determinant of A’ by row 7 and then using 


(A.76) we get 


det A’ = det A — (—1)'*? i det AY =0 
aj 
———— 
det A 


Property 8: The ii’th element of the matrix B = ADA™! is bi = Nee djaija;; = 


> dj Xj. 


Example A.1 


56 66 75 97 6.16 069 —7.94 3.48 
75 54 82 28 -1.77 0.10 3.16 —0.49 
42= 11g 66 95 38]' S44) =| _660 173 855 —2.69 


9 51 8 11 3.21 —0.14 —2.77 0.70 


(A.80) 


In this case y(A2) = G(A2)/a(A2) = 207.68/1.367 = 151.9 and y*(A2) = 51.73 
(obtained numerically using (A.72)). Furthermore, ||A||m = 2max{22.42, 19.58} = 
44.84, and ||Allsum = 50.19, so (A.77) with k(m) = k(4) = 2 is satisfied. The matrix 
Az is non-singular and the 1,3-element of the RGA is \13(A2) = —7.94. Thus from 
Property 6 the matriz Az becomes singular if the 1,3-element is perturbed from 75 
to 75(1 — ) =84.45. 


aah 
7.94 
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A.4.2. RGA of non-square matrix 


The RGA may be generalized to a non-square | x m matrix A by use of the 
pseudo inverse At defined in (A.59). We have 


A(A) = Ax (At)? (A.81) 


Properties 1 and 2 (permutations) of the RGA also hold for non-square 
matrices, but the remaining properties do not apply in the general case. 
However, they partly apply if A is either full row rank or full column rank. 


1. A has full row rank, i.e., r = rank(A) =1 (i.e. A may have more inputs than 
outputs, and the outputs are linearly independent). In this case AAt = TJ, 
and the following properties hold 

(a) The RGA is independent of output scaling, i.e. A(DA) = A(A). 

(b) All rows of the RGA sum to 1, 04" Ay = 1. 

(c) Column j of the RGA sums to the square of the 2-norm of the j’th row 
in V,, 


l 

Do Ag = leP Ve <1 (A.82) 

i=1 
Here V, contains the first r input singular vectors for G, and e; is an 
m x 1 basis vector for input uj; e; =[0--- 0 1 0 --- O]" where 1 
appears in position 7. 

(d) The diagonal elements of B = ADA? are by = 4 djaija;; 

et d; ij, where a;, denotes the ji’th element of At and D is any 
diagonal matrix. 


2. A has full column rank, i.e., r = rank(A) = m (i.e. A may have fewer 
inputs than outputs, and the inputs are linearly independent). In this case 
AtA=T, and the following properties hold 

(a) The RGA is independent of input scaling, i.e. A(AD) = A(A). 
(b) All columns of the RGA sum to 1, se Aj = 1. 


(c) Row i of the RGA sums to the square of the 2-norm of the i’th row in 
U,, 


m 
Yj = NleF UB <1 (A.83) 
i=1 
Here U;, contain the first r output singular vectors for G, and e; is an 
Lx 1 basis vector for output yj; e; =[0--- 0 1 0--- OJ’ where 1 
appears in position 2. 
% L = 
(d) The diagonal elements of B = AtDA are equal to bj; = 0-1 a;j;diaijy = 


Was. d;\;;, where a,; denotes the ji’th element of At and D is any 


diagonal matrix. 
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3. General case. For a general square or non-square matrix which has neither 
full row or full column rank, identities (A.82) and (A.83) still apply. 
From this it also follows that the rank of any matrix is equal to the sum 
of its RGA-elements: Let the 1 x m matrix G have rank r, then 


S> Aig (G) = rank(@) = r (A.84) 


Proof of (A.82) and (A.83): We will prove these identities for the general case. 
Write the SVD of G as G = U,=,V," (this is the economy size SVD from (A.56)) 
where ¥, is invertible. We have that gi; = ef U, dy V," e;, [@*],, -_ el V,Dn Us ei, 
Un U, =I, and V,"V, = I, where I, denotes identity matrix of dim r x r. For the 
row sum (A.83) we then get 


m m 

= S- H HoH —1y77H 
rij = ey U, dV, e€j;€3 Va ds U; Ee, = 

j=l joi 


m 
H H H ee H H Hy 2 
e; Up dV, } eje; Vr, U, es =e; U,U, ex = le; U>|l3 
j=l 
nn 
Im 


and the result for the column sum (A.82) is proved in a similar fashion. 


Remark. The extension of the RGA to nonsquare matrices was done by Chang and 
Yu (1990) who also stated most of its properties, although in a somewhat incomplete 
form. More general and precise statements are found in e.g. Cao (1995). 


A.4.3. Computing RGA with MATLAB 


If G is a constant matrix then the RGA can be computed using: 
RGA = G.*pinv(G.’); 

If G(jw) is a frequency-dependent matrix generated using the pu toolbox, e.g., 
G=pck(A,B,C,D); omega=logspace(-2,2,41); Gw=frsp(G, omega); 

then the RGA as a function of frequency can be computed using: 


RGAW = veval(’.*’ ,Gw,vpinv(vtp(Gw))); 


A.5 Norms 


It is useful to have a single number which gives an overall size of a vector 
or a matrix, or of a signal or a system. For this purpose we use functions 
which are called norms. In everyday life, the most commonly used norm is the 
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Euclidean vector norm, |le||2 = \/Je1|? + |e2|2 +--+ |em|?. This is simply the 
distance between two points y and x, where e; = y; — x; is the difference in 
their coordinates. 


Definition A.7 A norm of e (which may be a vector, matriz, signal or 
system) is a real number, denoted |le||, that satisfies the following properties. 


1. Non-negative: ||e|| > 0. 
2. Positive: |le|| =0 << e =0 (for seminorms we have |le|| =0 <= e = 0). 
3. Homogeneous: ||a - el| = |a| - |le|| for all complez scalars a. 
4. Triangle inequality: 
Ilex + €2[| < llexll + lleall (A.85) 


More precisely, e is an element in a vector space V over the field C of complex 
numbers, and the properties above must be satisfied Ve,e1,e2 € V andVa €C. 


We will consider the norms of four different objects (norms on four different 
vector spaces): 


1. e is a constant vector. 

2. e is a constant matrix. 

3. € is a time dependent signal, e(t), which at each fixed t is a constant scalar 
or vector. 

4. eis a “system”, a transfer function G(s) or impulse response g(t), which 
at each fixed s or t is a constant scalar or matrix. 


Cases 1 and 2 above involve spatial norms and the question is: “How do we 
average or sum up the channels?”. Cases 3 and 4 involve function norms or 
temporal norms where we want to “average” or “sum up” as a function of 
time or frequency. Note that the first two are finite dimensional norms, while 
the latter two are infinite-dimensional. 

A remark on notation. The reader should be aware that the notation on 
norms in the literature is not consistent, and one must be careful to avoid 
confusion. First, in spite of the fundamental difference between spatial and 
temporal norms, the same notation, ||-||, is generally used for both of them, and 
we shall adopt that here. Second, the same notation is often used to denote 
entirely different norms. For example, consider the infinity-norm, |lelloo. If 
e is a constant vector, then |le||,. is the largest element in the vector (we 
often use |lel| max for this). If e(t) is a scalar time signal, then |le(t)||.. is the 
peak value of |e(t)| as a function of time. If E is a constant matrix then 
||E||o0 may denote the the largest matrix element (we use ||A||max for this), 
while other authors use ||E||.. to denote the largest matrix row-sum (we use 
||E|licoo for this). Finally, if E(s) is a system (transfer function), then ||E|loo 
is the H,. norm which is the peak value of the maximum singular value of EF, 
|E(s)|loo = max, F(E(jw)) (which is how we mostly use the oo-norm in this 
book). 
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A.5.1 Vector norms 


We will consider a vector a with m elements, that is, the vector space is 
V =C™. To illustrate the different norms we will calculate each of them for 
the vector 


by 1 
b=| a |={ 3 (A.86) 
bs —5 


We will consider three norms which are special cases of the vector p-norm 
llallp = (So laal?)*” (A.87) 
i 


where we must have p > 1 to satisfy the triangle inequality (property 4 of a 
norm). Here a is a column vector with elements a; and |a;| is the absolute 
value of the complex scalar a;. 

Vector 1-norm (or sum-norm ). This is sometimes referred to as the “taxi- 
cab norm”, as in two dimensions it corresponds to the distance between two 
points when following the “streets”. We have 


A 
lla = So lai] (([|1 =1+3+5=9) (A.88) 


Vector 2-norm (Euclidean norm). This is the most common vector 
norm, and corresponds to the shortest distance between two points 


llall2= [So lal? (blo = VIF 9+ 2 = 5.916) (A.89) 


The Euclidean vector norm satisfies the property 


aa = lal’ (A.90) 


where a” denotes the complex conjugate transpose of the vector a. 
Vector oo -norm (or max norm ). This is the largest magnitude of the 
elements in the vector, and we usually use the notation ||@||max for this norm. 


A 
lla|]o0 = llalmax = max lai] ([[b|lmax = | — 5] = 5) (A.91) 


Since the various vector norms only differ by constant factors, they are often 
said to be equivalent. For example, for a vector with m elements 


llallmax < [lalla < Vm |lall max (A.92) 
lalla <llall: < Vim |lalls (A.93) 


In Figure A.1 the differences between the vector norms are illustrated by 
plotting the contours for ||a||p = 1 for the case with m = 2. 
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Figure A.1: Contours for the vector p-norm, ||a||p = 1 for p = 1, 2,00 


A.5.2. Matrix norms 


We will consider a constant 1 xm matrix A. Typically, A may be the complex 
matrix G(jw) evaluated at a given frequency w, which may represent the 
frequency response of a system with m inputs and / outputs. For numerical 
illustration we shall consider the following 2 x 2 matrix 


1 2 
a=[3, 3 ren 
Definition A.8 A norm on a matriz ||Al| is @ matrix norm if, in addition 
to the four properties of a norm given above, it also satisfies the multiplicative 
property (also called consistency condition): 


| ABI < |All - Bl (A.95) 


Property (A.95) is very important when combining systems, and forms the 
basis for the small gain theorem. Note that there exist norms on matrices 
(thus satisfying the four properties of a norm), which are not matrix norms 
(thus not satisfying (A.95)). Such norms are sometimes called generalized 
matriz norms. The only generalized matrix norm considered in this book is 
the largest-element norm, || A||max- 

Let us first examine three norms which are direct extensions of the 
definitions of the vector p-norms. 

Sum matrix norm. This is the sum of the element magnitudes 


|Allsum => laij] — (I[Aollsum =1+2+3+4=10) (A.96) 
bj 
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Frobenius matrix norm (or Euclidean norm). This is the square root of 
the squared sum of the elements 


Alle = [37 lass? = ytr(4a") = yex(4¥ A) (\lAolle = V30 = 5.477) 


(A.97) 
Here tr is the sum of the the diagonal elements, and A” is the complex 
conjugate transpose of A. The Frobenius norm is important in control because 
it is used for summing up the channels, for example, when using LQG optimal 
control. 
Max element norm. This is the magnitude of the largest element 


[|Allmax = max|aij| —_([|Aollmax = 4) (A.98) 


This norm is not a matrix norm as it does not satisfy (A.95). However note 
that Vim ||Allmax is a matrix norm. 

The three above norms are sometimes called the 1—, 2— and co—norm, 
respectively, but this is not used in this book to avoid confusion with the 
more important induced p-norms introduced next. 


Induced matrix norms 
w z 


Figure A.2: Representation of (A.99) 


Induced matrix norms are important because of their close relationship to 
signal amplification in systems. Consider the following equation which is 
illustrated in Figure A.2 

z= Aw (A.99) 


where we may think of w as the input vector and z as the output vector. We 
want to consider the “amplification” or “gain” of the matrix A as defined by 
the ratio ||z||/||w||. The maximum gain for all possible input directions is of 
particular interest. This is given by the induced norm which is defined as 


A 
|| Allip & max Aw lp (A.100) 
w70 |[w|lp 
where ||wl|, = (32, |wi|?)!/? denotes the vector p-norm. In words, we are 


looking for a direction of the vector w such that the ratio ||z||p/||w|lp is 
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maximized. Thus, the induced norm gives the largest possible “amplifying 
power” of the matrix. The following equivalent definition is also used 


Alli = max ||Aw||, = max ||Aw A.101 
[Ally = max, Aullp = max, | Aull (A.101) 


For the induced 1-, 2- and co-norms the following identities hold: 


| Alli1 = max( > la,;|) “maximum column sum” (A.102 
gj. 
a 


|Allioo = max() > laij|) “maximum row sum” (A.103 
a 
j 


|Alli2 = o(A) = 1/p(A# A) “singular value or spectral norm” (A.104 


Here the spectral radius p(A) = max; |A;(A)] is the largest eigenvalue of the 
matrix A. Note that the induced 2-norm of a matrix is equal to the (largest 
singular value, and is often called the spectral norm. For the example matrix 
in (A.94) we get 


Aolla = 6;  ||Aollico = 7; ||Aollig = (Ao) = 5.117 (A.105) 


Theorem A.9 All induced norms ||Allip are matric norms and thus satisfy 
the multiplicative property 


IAB lip S [Allin - Blip (A.106) 


Figure A.3: Representation of (A.106). 


Proof: Consider the following set of equations which is illustrated graphically in 
Figure A.3. 


z=Av, v= Bw z= ABw (A.107) 


From the definition of the induced norm we get by first introducing v = Bw, then 
multiplying the numerator and denominator by ||v||p 4 0, and finally maximizing 
each term involving w and v independently, 


v 


——_N 
AB A B A B 
AB lp 2 max AB lle — ag Ole, WB saga ACI sane Bll 
w40 — |lwllp w#0 lolly — lwll» ~ v¥0° |lellp #0 |leolp 


and (A.106) follows from the definition of an induced norm. 
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Implications of multiplicative property 


For matrix norms the multiplicative property ||AB|| < ||A|| - ||B|] holds for 
matrices A and B of any dimension as long as the product AB exists. In 
particular, it holds if we choose A and B as vectors. From this observation we 
get: 

1. Choose B to be a vector, i.e B = w, and we derive from (A.95) for any 
matrix norm 


|| Awl] < |All: [ell (A.108) 


We say that the “matrix norm ||A|| is compatible with its corresponding vector 

norm ||w||”. Clearly, from (A.100) any induced matrix p-norm is compatible 

with its corresponding vector p-norm. Similarly, the Frobenius norm is 

compatible with the vector 2-norm (since when w is a vector ||w|| 7 = ||w|l2). 
2. From (A.108) we also get for any matrix norm 


A 
Ay Se A (A.109) 
w70 || 

Note that the induced norms are defined such that we have equality in the 
above equation. The property ||Al|7 > @(A) then follows since ||w||7 = ||wle. 
3. Choose both A = 2” and B = w as vectors. Then using the Frobenius 
norm or induced 2-norm (singular value) in (A.95) we derive the Cauchy- 

Schwarz inequality 
J2"w| < llzll2- llewll2 (A.110) 


where z and w are column vectors of the same dimension and z¥w is the 
Euclidean inner product between the vectors z and w. 

4. The inner product can also be used to define the angle ¢ between two 
vectors z and w 


@ = arccos GE) (A.111) 


I[2ll2° lle 


Note that with this definition ¢ is between 0° and 90°. 


A.5.3. The spectral radius 
The spectral radius p(A) is the largest eigenvalue of the matrix A. 


p(A) = max |A;(A)| (A.112) 


It is not a norm as it does not satisfy properties 2 and 4 of a norm. For 


example, for 
1 0 1 10 
aa [3 9). ae [h 9) tity 
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we have p(Ai) = 1, p(Ac) = 1. But p(Ai + Ag) = 12 is larger than 
p(A1) + p(Az) = 2 and is therefore inconsistent with the triangle inequality 
(property 4 of a norm). Also, it does not satisfy the multiplicative property 
(A.95). For example, (A; Az) = 101.99 which is larger than p(A1)p(A2) = 1. 

Although the spectral radius is not a norm, it provides a lower bound on 
any matrix norm which can be very useful: 


Theorem A.10 For any matrix norm (and in particular for any induced 
norm) 
p(A) < |All (A.114) 


Proof: Since \;(A) is an eigenvalue of A, we have that At; = \;t; where t; denotes 
the eigenvector. We get 


Aa] - [tall = [Astall = [Atal] < TAM [eel (A.115 


where the inequality follows from (A.108). Thus for any matrix norm |;(A)| < ||A| 
and since this must hold for all eigenvalues the result follows. 


For our example matrix in (A.94) we get p(Ag) = V10 & 3.162 which is 
less than all the induced norms (||Ao|li1 = 6, ||Aollico = 7,4(Ao) = 5.117 
and also less than the Frobenius norm (||A||~7 = 5.477) and the sum-norm 
(||Allsum = 10). 

A simple physical interpretation of (A.114) is that the eigenvalue measures 
the gain of the matrix only in certain directions (the eigenvectors), and must 
therefore be less than for an matrix norm which allows any direction and 
yields the maximum gain, recall (A.109). 


A.5.4 Some matrix norm relationships 


The various norms of the matrix A are closely related as seen from the 
following inequalities taken from Golub and van Loan (1989, p. 15) and Horn 
and Johnson (1985, p. 314): Let A be an ! x m matrix, then 


a(A) < ||Ally < /min(,m) a(A) (A.116 


\|Allmax < (A) < Vim || Allmax (A.117 
a(A) < V|AllalAllico (A.118 


1 
Taal Alico < (4) < Vi [Alico (A.119 


1 
Gillan < 914) < vm [lA (A.120 
max{@(A), ||Alles [|Allits|Allico} < ||Allsum (A121 
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All these norms are matrix norms and satisfy (A.95) except ||Allmax. The 
inequalities are tight, that is, there exist matrices of any size for which equality 
holds. Note from (A.117) that the singular value is quite closely related to the 
largest element of the matrix ||A||max, that is, one can usually just by looking 
at the matrix estimate the value of the largest singular value (this is not true 
for the other singular values, however). 

An important property of the Frobenius norm and the singular value 
(induced 2-norm) is that they are invariant with respect to unitary 
transformations, i.e., for unitary matrices Uj, satisfying U;Uf! = I, we have 


|U1 AU2||F = ||Alle (A.122) 
&(U, AU2) = &(A) (A.123) 


From an SVD of the matrix A = UNV” and (A.122) we then obtain an 
important relationship between the Frobenius norm and the singular values, 


I|Allr = [2 ai(A) (A.124) 


The Perron-Frobenius theorem applies to a square matrix A: 


min ||DAD™" la = min ||DAD™" liso = p((Al) (A.125) 


Here D is a diagonal “scaling” matrix, |A| denotes the matrix A with all its 
elements replaced by their magnitudes, and p(|A|) = max; ||A;(|Al)|| is the 
Perron root (Perron-Frobenius eigenvalue). The Perron root is greater or equal 
to the spectral radius, p(A) < p(|Al). 


A.5.5 Matrix and vector norms with MATLAB 


The following MATLAB commands are used for matrices: 
&(A) = ||Alji2 norm(A,2) or max(svd(A)) 

|All norm(A, 1) 

||Allico norm(A, “inf ~) 
||Allz7 norm(A, “fro’) 

||Allsum sum (sum(abs(A))) 

||Allmax max (max(abs(A))) (which is not a matrix norm) 
p(A) max(abs(eig(A))) 

p(|A]) max(eig(abs(A))) 

(A) = 4(A)/o(A)  conaca) 


For vectors: 
lall1 norm(a, 1) 
|all2_ norm(a, 2) 
|@||max nmorm(a, “inf’) 
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A.5.6 Signal norms 


We here consider the temporal norm of a time-varying (or frequency-varying) 
signal, e(t). In contrast with spatial norms (vector and matrix norms), we 
find that the choice of temporal norm makes a big difference. As an example, 
consider Figure A.4 which shows two signals, e1(t) and e2(t). For signal e; (t) 
the infinity-norm (peak) is one, ||e(t)||o. = 1, whereas since the signal does 
not “die out” the 2-norm is infinite, ||e(¢)||x2 = oo. For signal e2(t) the opposite 
is true. 


A e 


Figure A.4: Signals with entirely different 2-norms and oo-norms. 


A e 


Ilelloo 


Figure A.5: Signal 1-norm and oo-norm. 


For signals we may compute the norm in two steps: 


1. “Sum up” the channels at a given time or frequency using a vector norm 
(for a scalar signal we simply take the absolute value). 
2. “Sum up” in time or frequency using a temporal norm. 


Recall from above, that the vector norms are “equivalent” in the sense that 
they only differ by a constant factor depending on the size of the vector. 
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Therefore, it usually does not make too much difference which norm we use in 
step 1. We normally use the same p-norm both for the vector and the signal, 
and thus define the temporal p-norm, |le(¢) ||), of a time-varying vector as 


I, norm: (|le(t)||p)” = - a lex(7) |? dr (A.126) 


es 


lle(r) IIb 
The following temporal norms of signals are commonly used. 


Remark 1 Note that here ||e(7)||, denotes the vector p-norm of the vector e(¢) at 
a fixed value t = 7: 


ei(T) 


e2(T) 1/p 
e(r)= |. | teats (Slat?) (A.127) 


em(r) 


This should not be confused with the temporal norm |le(t)|| used in Step 2 where t¢ 
is not fixed. 


Remark 2 In most cases we assume e(t) = 0 for t < 0 so the lower value for the 
integration may be changed to 7 = 0. 


1-norm in time (integral absolute error (IAE), see Figure A.5): 
foe) 
le(hn= f Sletr)lar (A.128) 
or 
2-norm in time (quadratic norm, integral square error (ISE), “energy” of 


signal): 
lle(t)|l2 = / Dy lei(7) Pdr (A.129) 


oo-norm in time (peak value in time, see Figure A.5): 


lle(t) loo == max (max lei(7)1) (A.130) 


Remark. To be mathematically correct we should here have used sup, (least upper 
bound) rather than max-, since the maximum value may not actually be achieved 
(e.g. if it occurs for t = co). 


In addition, we will consider the power-norm or RMS-norm (which is actually 
only a seminorm since it does not satisfy property 2 of norms) 


: er i 
lle(¢)|lbow = tim op | Dole bear (A.131) 
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A.5.7 Signal interpretation of various system norms 


Two system norms are considered in Chapter 4.10. These are the Hz norm, 
|G(s)|l2 = ||g(d)||2 and the H.. norm, ||G(s) ||... The main reason for including 
the following section is to show that there exists many ways of evaluating 
performance in terms of signals, and to show how the Hz and Ho» norms 
are useful measures in this context. This in turn will be useful to help us 
understand how to select performance weights in controller design problems. 
The derivations of the results in this section require a good background in 
functional analysis and can be found in Doyle et al. (1992) and Zhou et al. 
(1996). 
Consider a system with input d and output e, 


e=Gd (A.132) 


For performance we may want the output signal e to be “small” for any allowed 
input signals d. We have to specify 


1. What the allowed d’s are. (What set does d belong to?) 
2. What we mean by “small”. (What norm should we use for e?) 


1. Some possible input signal sets are: 


1. d(t) consists of impulses, 6(t). (these generate step changes in the states, 
which is the usual way of introducing the LQ-objective and gives rise to 
the H2 norm.) 

2. d(t) is a white noise process with zero mean. 

3. d(t) = sin(wt) with fixed frequency is applied from t = —co (steady-state 

sinusoidal response) 

d(t) is a set of sinusoids, all frequencies allowed 

d(t) is bounded in energy, ||w(t)|l2 <1 

d(t) is bounded in power, ||d(¢)||pow < 1 

d(t) is bounded in magnitude, ||d(t) ||. <1 


Pou 


The three first inputs are specific signals, whereas the latter three are 
classes of inputs with bounded norm. Which of these input classes is the 
most reasonable, depends on the physical problem at hand. 

2. To measure the output signal one may consider the following norms: 


1-norm, ||e(t)|l1 

2-norm (energy), |le(t)|l2 

oo-norm (peak magnitude), ||e(t)||o0 
Power, lle(t) pow 


ai ate 


and also other norms are possible. Again, it is an engineering issue to decide 
which norm is the most appropriate. We will now consider which system norms 
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result from the definitions of input classes, and output norms, respectively. 
That is, we want to find the appropriate system gain to test for performance. 
The results for SISO systems where d(t) and e(t) are scalar signals are 
summarized in Tables A.1 and A.2. In these tables G(s) is the transfer 
function and g(t) is its corresponding impulse response. Note in particular 
that 


Hoo norm: ||G(s)|loo = maxay [RE (A.133) 


i, norm: ||9(t)|h1 = maxa) [Sit (A.134) 


We see from Tables A.1 and A.2 that the H2 and H. norms appear in 
numerous positions. This gives some basis for their popularity in control. In 
addition, the 7. norm results if we consider d(t) to be the set of sinusoids 
with all frequencies allowed, and measure the output using the 2-norm (not 
shown in Tables A.1 and A.2, but discussed in section 3.3.5). Also, the He 
norm results if the input is white noise and we measure the output using the 
2-norm. 


d(t) = 6(t) | d(t) = sin(at) 
llelle IG(s)lle__| oo (usually) 
llelloo [Nahe a(G(jw)) 
llell pow 0 557 (G (jw) 


Table A.1: System norms for two specific input signals and three different output 
norms 


IId|l2 IIdl]oo IIdllpow 
llell2__ |_IG(s)Iloo 00 oo (usually) 
llelloo || HG(s)Ie | Ig@Ih | oo (usually) 

llell pow 0 SNG(s)Mloo | NE )Iheo 


Table A.2: System gains for three sets of norm-bounded input signals and three 
different output norms. The entries along the diagonal are induced norms. 


The results in Tables A.1 and A.2 may be generalized to MIMO systems 
by use of the appropriate matrix and vector norms. In particular, the induced 
norms along the diagonal in Table A.2 generalize if we use for the Hy 
norm ||G(s)||. = max, 4(G(jw)) (as usual), and for the 1; norm we use 
llg(t)|l14. = max; ||g:(t) ||1, where g(t) denotes row i of the impulse response 
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matrix. The fact that the H,. norm and 1, norm are induced norms makes 
them well suited for robustness analysis, for example, using the small gain 
theorem. The two norms are also quite closely related as seen from the 
following bounds for a proper scalar system 


IIG(s)lloo < Ilg(H)lla < (2n + 1) |IG(s) [loo (A.135) 


Here n is the number of states in a minimal realization. A multivariable 
generalization for a strictly proper system is (Zhou et al., 1996) 


11 <[GUIiw < f° a(g(d)dt <2 > 05 <2M|IG(s)o (A136) 


where o; is the i’th Hankel singular value of G(s) and g(t) = Ce“'B is the 
impulse response matrix. 


A.6 Factorization of the sensitivity function 


Consider two plants G and G’. Typically, G is the nominal model and G"’ 
is some alternative model of the plant. Assume that the same controller is 
applied to both plants. Then the corresponding sensitivity functions are 


S=(1+GK)', S'=(1+G'K)" (A.137) 


A.6.1 Output perturbations 


Assume that G’ is related to G by either an output multiplicative perturbation 
Eo, or an inverse output multiplicative perturbation Ejo. Then S$’ can be 
factorized in terms of S as follows 


S'=S(I+EoT)'; G=(I+Eo)G (A.138) 
S' = SUI -—EjoS)"'(I- Eio); 1G’ = (I- Eio)'G4 (A.139) 
For a square plant, Eo and Ejo can be obtained from a given G and G’ by 
Fo =(@’-G)G"; Ei =(G'-Ga (A.140) 
Proof of (A.188): 


I4+G@K=1+(1+Eo)GK = (1+ EoGK(I+GK)~')(I+ GR) (A.141) 
ee—_._ 
T 
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Proof of (A.139): 
I+@K = I4+(I-—Eio)'GK = (I - Ejo)|((I — Eio) + GK) 
(I — Eio)7'(I — Eio (I+ GK)“) (1+ GK) (A.142) 
e—_._ —“——-’ 
S 


Closely related factorizations may be written in terms of the complimentary 
sensitivity (Horowitz and Shaked, 1975; Zames, 1981). For example, by writing 
(A.138) on the form S = S’(I+ EoT) and using the fact S—S’ = T’—T, we 
get 

T' —-T=S'EoT (A.143) 


A.6.2. Input perturbations 


For a square plant the following factorization in terms of the multiplicative 
uncertainty Ey at the input to the plant is useful 


S'=S(I+GE;G"'T)"' =SGU+E;T;)"'G'; G'=G(I+E;) (A.144 


Here T; = KG(I + KG)~! is the input complementary sensitivity. 


Proof: Substitute Eo = GE;G™' into (A.138) and use G7'T = T;G™". 


Alternatively, we may factor out the controller to get 


S' = (1+ TK E/K)S = K7(I1+T;E1) KS (A.145 


Proof: Start from [+ G' K = 1+G(I+ Er)K and factor out (I+ GK) to the left. 


A.6.3 Stability conditions 


The following Lemma then follows directly from the generalized Nyquist 
Theorem and the factorization (A.138): 


Lemma A.11 Assume that the negative feedback closed-loop system consist- 
ing of G(s)K(s) is stable. Suppose G' = (I+ Eo)G, and let the number of 
open loop unstable poles of G(s)K(s) and G'(s)K(s) be P and P’, respec- 
tively. Then the negative feedback closed-loop system consisting of G'(s)K(s) 
is stable if and only if 


N(det(I + EoT)) = P — P' (A.146) 


where N denotes the number clockwise encirclements of the origin as s 
traverses the Nyquist D-contour. 
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Proof: Let N(f) denote the number of clockwise encirclements of the origin by 
f(s) as s traverses the Nyquist D contour. For the encirclements of the product 
of two functions we have N(fif2) = N(fi) + N(fe)and (A.141) and the fact 
det(AB) = det A- det B yields 


N (det(I + G' K)) = N(det(I + EoT)) + N(det(I + GK)) (A.147) 


For stability we need from Theorem 4.14 that \V(det(J + G’K)) = —P’ and 
N (det(I + GK)) = —P and Lemma A.11 follows. The Lemma is from Hovd and 
Skogestad (1994a); similar results, at least for stable plants, have been presented by 
e.g. Grosdidier and Morari (1986) and Nwokah and Perez (1991)). 


In other words, (A.146) says that for stability det(J + EoT) must provide for 
the difference in the number of required encirclements between det (I + G’K 
and det(I + GK). If this is not the case then at least one of the systems 
consisting of GK or G’K must be unstable. We show in Theorem 6.9 that 
information about what happens at s = 0 may provide useful information 
about the number of encirclements. 


A.7 Linear Fractional Transformations 


Linear Fractional Transformations (LFTs) as they are presently used in 
the control literature were first introduced by Doyle (1984). These matrix 
functions are very powerful tools in system analysis and design. Consider a 
matrix P of dimension (n1 + n2) x (m1 + mz) and partition it as follows 


Pu Piz 
P= A.148 
EE Sa ( ) 


Let the matrices A € R™%*™ and K € C™*™ have compatible dimension 
with the upper and lower partitions of P, respectively. We adopt the following 
notation for the lower and upper linear fractional transformations 


F,(P,K) 2 Pu + PrK (I — PooK) Pm (A.149) 


F.(P, A) = Pog + Py AT — Py A) Pi. (A.150) 


where subscript | denotes lower and subscript u upper. In the following let R 
denote some transfer function resulting from an LFT. 

The lower fractional transformation F)(P,K) is the transfer function R 
resulting from wrapping (positive) feedback K around the lower part of P 
as illustrated in Figure A.6a. To see this note that the block diagram in 
Figure A.6 (a) may be written as 


z = Pyw+ Pou, UV = Pyw+ Pou, u= Kv (A.151) 
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Figure A.6: a) Ras lower LFT in terms of K. b) R as upper LFT in terms of A. 


Eliminating y and u from these equations yields 
z= Rw = FP, K)w = [Pu t+ PK (I — Pook) | Poijw (A.152) 


In words we say that “R is written as a lower LFT in terms of K”. Similarly, 
in Figure A.6 (b) we illustrate the upper LFT, R = F,(P, A), obtained by 
wrapping (positive) feedback A around the upper part of P. 


Figure A.7: Interconnection of LFTs yields a LFT. 


A.7.1 Interconnection of LFTs 


An important property of LFTs is that any interconnection of LFTs is again 
an LFT. Consider Figure A.7 where R is written in terms of a lower LFT of 
K', which again is a lower LFT of K, and we want to express R directly as 
an LFT of K. We have 


R=F(Q,K') where K! = F(M,K) (A.153) 
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and we want to obtain the P (in terms of Q and M) such that 


R=F(P,K) (A.154) 
We find 
Pu Pry 
P= = 
& Po 
Qu + Qi2Mii(T — Q22Mi1)~1Qa1 Qie(I — M11Q22) 1 Mie 
Mai(I = Q22Mi1)-1Qai Moa2 + M1 Q22 (I oa M11Q22) 1 Mie 
(A.155) 


Similar expressions apply when we use upper LFTs. For 
R=F,(M, A’) where A’ = F,(Q,A) (A.156) 
we get R= F,,(P, A) where P is given in terms of Q and M by (A.155). 


A.7.2 Relationship between fF; and F,,. 


F, and F,, are obviously closely related. If we know R = F,(M, K), then we 
may directly obtain R in terms of an upper transformation of K by reordering 
M. We have 


F,,(M, K) = F,\(M, K) (A.157) 
= a= [9 Z]ar(? 7 cas) 


A.7.3 Inverse of LFT 
On the assumption that all the relevant inverses exist we have 

(A(M, K))~' = (M, K) (A159) 
where M is given by 


yo M;} —Mz' Mi» 


M= = 2 A.160 
Mu M,' Mo2 = Mx Mj; Mie ( ) 


This expression follows easily from the matrix inversion lemma in (A.5). 


A.7.4. LEFT in terms of inverse 


Given an LFT in terms of K, it is possible to derive an equivalent LFT in 
terms of K~'. If we assume that all the relevant inverses exist we have 


Fi(M, K) = Fi(M, K~) (A.161) 
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where M is given by 


vee Mi — Miy2M55' May —Mj2M3,' 


M= ie a A.162 
Myy' Max M33 ( ) 


This expression follows from the fact that (I+ L)~! = I— L(I+ L)~! for any 
square matrix LD. 


A.7.5 Generalized LFT: The matrix star product 


ny Nu 


Figure A.8: Star product of Q and M, R=S(Q,M). 


A generalization of the upper and lower LFTs above is provided by Redheffer’s 
star product. Consider Figure A.8 where Q and M are interconnected such 
that the last n,, outputs from Q are the first n,, inputs of M, and the first n, 
outputs from M are the last n; inputs of Q. The correspondingly partitioned 


matrices are 
Qu Qi My Mi 
— M = 
@ ie Q22 |’ Ma, Mo 


The overall matrix R with these interconnections closed (see Figure A.8) is 
called the star product, S(Q, M), between Q and M. We find that 


R=S(Q,M)= 
Qu + QeMii(l — Q22M11)~'Qar Qie(I — M11Q22) 1 Mis 
Mai (I = Q22Mi1)~1Qai Mo. + Ma1Q22(L = M11Q22) 1 Mie 


(A.163) 


Note that S(Q, /) depends on the chosen partitioning of the matrices Q and 
M. If one of the matrices is not partitioned then this means that this matrix 


MATRIX THEORY AND NORMS 567 


has no external inputs and outputs, and S(Q, M) then gives the “maximum” 
interconnection. For example, we have for the LFTs 


Fi(P, K) = S(P,K) (A.164) 


F,(P, A) = S(A, P) (A.165) 


(the order in the last equation is not a misprint). Of course, this assumes that 
the dimensions of K and A are smaller than that of P. The corresponding 
command to (A.163) in the MATLAB Mu-Toolbox is 


starp(Q,M,nu,n1) 


where n,, and n; are as shown in Figure A.8. If ny, and n; are not specified 
then this results in a “maximum” interconnection involving the corresponding 
LFT in (4.164) or (4.165). 
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APPENDIX B 


PROJECT WORK and EXAM 
PROBLEM 


B.1 Project work 


In many cases the students formulate their own project based on some 
application they are working on. In other cases the project is given by the 
instructor. In either case, a preliminary statement of the problem must be 
approved before starting the project, see the first item below. 


1. Preliminary problem definition (Introduction). State the problem: 
i) Explain in words and with a simple figure what the problem is about. 


ii) Discuss briefly what the control objective is (what output signal do 
you want to keep small?). 


iii) Specify exogenous inputs (disturbances, noise, setpoints), manipulated 
inputs, measurements, and controlled outputs (exogenous outputs). 


iv) Describe the most important sources of model uncertainty. 


v) What control problems are you expecting (Interactions, RHP-zeros, 
etc.)? 


The preliminary statement (1-2 pages) must be handed and approved 
before starting the project. 

2. Plant model. Specify parameters, etc. and obtain a linear model of your 
plant. Comment: You may want to consider more than one operating point. 

3. Analysis of the plant. For example, compute steady-state gain matrix or 
plot gain elements as a function of frequency, obtain poles and zeros (both 
of individual elements and overall system), compute SVD and comment 
on directions and condition number, perform RGA-analysis, disturbance 
analysis, etc.. Does the plant seem to be difficult to control? 

4. Initial controller design. Design at least two controllers, for example, 


(i) Decentralized (PID). 
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(ii) ”Full” (Using LQG, LTR, Ho (in principle same as LQG, but different 
way of choosing weights), H. loop shaping, Hoo, etc.). 


(iii) Decoupler combined with PI. 


5. Simulations. Perform some simulations in the time domain for the 
closedloop system. 
6. Robustness analysis using pu. 


(a) Choose reasonable performance and uncertainty weights. Plot all 
weights as a function of frequency. 


(b) State clearly how RP is defined for your problem (block diagram). 
(c) Compute ys for NP, RS, and RP. 


(d) Perform some sensitivity analysis. For example, change weights 
(e.g., make one output channel faster and another slower), move 
uncertainties around (e.g., from input to output), change A’s from 
a diagonal to full matrix, etc. 


Comment: You may need to back to step (a) and redefine your weights if 
you find out from step (c) that your original weights are unreasonable. 

7. Optional: Ho or p-optimal controller design. Design a Ho. or p-optimal 
controller and see if you can improve the response and satisfy RP. Compare 
simulations with previous designs. 

8. Discussion. Discuss results. Also comment on how things could have been 
changed such that you learned more from the project. 

9. Conclusion. 


B.2 Sample exam problem 


A 5-hour exam. 


Problem 1 (35%). Controllability analysis. 

Perform a controllability analysis (compute poles, zeros, RGA11(s), check 
for constraints, discuss the use of decentralized control (pairings), etc.) for 
the following four plants. You can assume that the plants have been scaled 
properly. 

a) 2 x 2 plant: 


1 s—1 1 


Gl) = By as —11) | 90 10(s—1) (B.1) 
b) SISO plant with disturbance: 
—0.1ls+1 40 
g(s) = 200 e : gals) = (B.2) 


(s + 10)(0.25 +1)’ stl 
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c) Plant with two inputs and one output: 


8 4 3 


WS) = 95549" + Oosa1%2 + O0nsF 1! 


(B.3) 


d) This 2 x 2 plant with 1 disturbance is given on state-space form. i) Find a 
block diagram representation with each block on the form k/1+7s. ii) Perform 
a controllability analysis. 


Ly = —0.121 ate 0.01u4 


Le = —0.522 a 102 
bs = 0.2521 + 0.2529 — 0.2523 + 1.25d 


Problem 2 (25%). General Control Problem Formulation. 


ACID 


pH #& 13 


Figure B.1: Neutralization process 


Consider the neutralization process in Figure B.1 where acid is added in 
two stages. Most of the neutralization takes place in tank 1 where a large 
amount of acid is used (input u,) to get pH about 10 (measurement y;). In 
tank 2 the pH is fine-tuned to about 7 (output y2) by using a small amount 
of acid (input uz). (This is just to give you some idea of a real process; all the 
information you need to solve the problem is given below). 

A block diagram of the process is shown in Figure B.2. There are one 
disturbance, two inputs and two measurements (y; and yo). The main control 
objective is to keep yo & rg. In addition we would like to reset input 2 to its 
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Figure B.3: General control configuration 


nominal value, that is, we want at low frequencies ug ¥ r,,,. Note that there 
is no particular control objective for y,. 

a) Define the general control problem, that is, find z, w, u, v and P (see 
Figure B.3). 

b) Define an Hoo-problem based on P. Discuss briefly how you want the 
unweighted transfer functions from d to z to be, and use this to say a little 
about how the performance weights should be. 


Figure B.4: Proposed control structure for neutralization process 


c) A simple practical solution based on single loops is shown in 
Figure B.4. Explain briefly the idea behind this control structure, and 
find the interconnection matrix P with the generalized controller K = 
diag{k,, hk, k3} (Note: u and y are different in this case, while w and 
z are the same as in a)). 


Problem 3 (40%). Various. 


Briefly answer the following questions: 
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a) Consider the plant 
£(t) =a(1+1.5d,)a(t) + 0(1+0.2d,)u(t); y= 


Here |d,| < 1 and |6,| < 1. For the feedback controller K(s) derive the 
interconnection matrix M for robust stability. 

b) For the above case assume you use the condition minp¢(DMD~“) <1 
to check for robust stability (RS). What is D (give as few parameters as 
possible)? Is this RS-condition tight in this case? 

c) When is the condition p(MA) < 1 necessary and sufficient for robust 
stability? Based on p(MA) < 1 derive the RS-condition (M/Z) < 1. When is 
it necessary and sufficient? 

d) Let 


G,(s) = gitwiAr giz +weAe 


, [Ai] <1,|As| <1 
gar + w3Ay 922 [41] $1,149] s 


Represent this uncertainty as Gp = G+ W,AW2 where A is diagonal. 
Represent this on the M A-structure and derive the RS-condition. 
e) Let 
1— 4s 

p(s) = Tos 
and consider the controller K(s) = c/s. Represent this on the MA-structure 
and find the RS-condition. 
g) Show by a counterexample that in general o(AB) is not equal to (BA). 
Under what conditions is u(AB) = p(B.A)? 
g) The PRGA matrix is defined as T = GaiagG'. What is its relationship to 
RGA? Derive a performance condition for decentralized control involving the 


PRGA. 


6=O(1+wA), |Al<1 
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phase (wis0), 34, 41 


D-stability, 469 

Decay ratio, 31 

Decentralized control, 89, 255, 441, 

460-478 

application: distillation process, 474 
CLDG, 471 
controllability analysis, 472 
D-stability, 469 
input uncertainty (RGA), 255 
interaction, 461 
pairing, 460, 463, 466, 468 
performance, 470 
PRGA, 461, 471 
RDG, 472 
RGA, 462-470 
sequential design, 450, 452, 477 
stability, 463 
triangular plant, 465 


why use, 448 
Decentralized Integral Controllability 
(DIC), 467 
determinant condition, 469 
RGA, 468 


Decibel (dB), 19 
Decoupling, 87-88 
dynamic, 87 
partial, 88 
steady-state, 87 
Decoupling element, 441 
Derivative action, 131, 206 
Descriptor system, 402 
Detectable, 137 
Determinant, 533 
Deviation variable, 5, 9 
Diagonal controller, see Decentralized 
control 
Diagonal dominance, 96, 464 
Differential sensitivity, 273 
Direction of plant, 76, see also Output 
direction 
Discrete time control 
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Heo loop shaping, 418 
Distillation process, 101, 523-529 
DV-model, 528 
diagonal controller, 332 
inverse-based controller, 258 
robust stability, 332 
sensitivity peak, 258 
LV-model, 525-528 
CDC benchmark problem, 526 
coupling between elements, 311 
decentralized control, 474 
detailed model, 527 
disturbance rejection, 247 
DkK-iteration, 358 
element-by-element uncertainty, 262 
feedforward control, 257, 458 
Heo loop-shaping, 104 
inverse-based controller, 101, 104, 
257, 350 
p-optimal controller, 358 
partial control, 458 
physics and direction, 82 
robust performance, 350 
robustness problem, 101, 257 
sensitivity peak (RGA), 257 
SVD-analysis, 82 
SVD-controller, 104 
Disturbance (d), 13 
limitation MIMO, 241-243 
limitation SISO, 199-201 
scaling, 6 
Disturbance model (Ga), 126, 153 
internal stability, 153 
Disturbance process example, 50 
Heo loop shaping, 404 
inverse-based controller, 51 
loop-shaping design, 53 
mixed sensitivity, 64 
two degrees-of-freedom design, 56 
Disturbance rejection, 51 
MIMO system, 89 
mixed sensitivity, 507 
DkK-iteration, 356 
Dynamic resilience, 173 


Eigenvalue (A), 77, 534 
measure of gain, 77 
pole, 138 
properties of, 535 
spectral radius, see Spectral radius 
state matrix (A), 536 


transfer function, 537 
Higenvector, 534 
left, 534 
right, 534 
Element uncertainty, 260, 545 
RGA, 260 
Estimator 
general control configuration, 114 
, see also Observer 
Euclidean norm, 552 
Excess variation, 32 
Exogenous input (w), 13 
Exogenous output (z), 13 
Extra input, 445 
Extra measurement, 443 


Fan’s theorem, 540 
FCC process, 267 
controllability analysis, 267, 457 
RGA-matrix, 98 
RHP-zeros, 267 
Feedback 
negative, 24, 71 
positive, 71 
why use, 26 
Feedback rule, 71 
Feedforward control, 26 
distillation process, 458 
perfect, 26 
Feedforward element, 441 
Fictitious disturbance, 26 
Flexible structure, 57 
Fourier transform, 126 
Frequency response, 17—23, 126 
bandwidth, see Bandwidth 
break frequency, 20 
gain crossover frequency (w-), 35, 40 
magnitude, 18, 19 
MIMO system, 74 
phase, 18 
phase crossover frequency (wiso), 34, 
41 
phase shift, 19 
physical interpretation, 17 
straight-line approximation, 22 
Frobenius norm, 552 
Full-authority controller, 504 
Functional controllability, 233, 233 
and zeros, 147 
uncontrollable output direction, 234 


Gain, 19, 76 
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Gain Margin (GM), 34, 37, 291 
LQG, 381 
Gain reduction margin, 35 
LQG, 381 
Gain scheduling 
Heo loop shaping, 416 
General control configuration, 106, 386 
including weight, 109 
Generalized controller, 106 
Generalized eigenvalue problem, 142 
Generalized inverse, 541 
Generalized plant, 13, 106, 112, 386 
Heo loop shaping, 411, 416 
estimator, 114 
feedforward control, 112 
input uncertainty, 319 
limitation, 116 
mixed sensitivity (S/KS), 393 
mixed sensitivity (S/T), 396 
one degree-of-freedom controller, 107 
two degrees-of-freedom controller, 112 
uncertainty, 309 
Gershgorin’s theorem, 464, 536 
Glover-McFarlane loop shaping, see 
Hoo loop shaping 
Gramian matrix, 132, 136, 482, 485 


He norm, 60, 163, 560 
computation of, 164 
stochastic interpretation, 388 

He optimal control, 386, 388-389 
assumptions, 386 
LQG control, 389 

Hoo loop shaping, 58, 399-420 
aero-engine, 519 
anti-windup, 419 
bumpless transfer, 419 
controller implementation, 408 
controller order, 490 
design procedure, 404 
discrete time control, 418 
gain scheduling, 416 
generalized plant, 411, 416 
implementation, 418 
MATLAB, 405 
observer, 415 
servo problem, 409, 414 
two degrees-of-freedom controller, 409— 

414 
weight selection, 520 
Hoo norm, 60, 164, 560 
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Hardy space, 60 
induced 2-norm, 164 
MIMO system, 85 
multiplicative property, 166 
relationship to H2 norm, 165 
Hoo optimal control, 386, 390-399 
y-iteration, 392 
assumptions, 386 
mixed sensitivity, 392, 506 
robust performance, 399 
signal-based, 396 
Hadamard-weighted H. problem, 116 
Hankel norm, 167-168, 401, 482, 484 
model reduction, 168, 484—486 
Hankel singular value, 167, 482, 483, 
487, 561 
aero-engine, 518 
Hanus form, 419 
Helicopter case study, 502-512 
Hidden mode, 137 
Hierarchical control, 450-460 
5 x 5 distillation process, 453 
cascade control, 456 
extra measurement, 456 
partial control, 450, 456 
sequential design, 452 
Hurwitz, 138 


Ideal resting value, 445 

Identification, 261 
sensitivity to uncertainty, 262 

Ill-conditioned, 94 

Improper, 5 

Impulse function, 125 

Impulse response, 32 

Impulse response matrix, 125 

Indirect control, 433, 451 
partial control, 456 

Induced norm, 552 
maximum column sum, 553 
maximum row sum, 553 
multiplicative property, 553 
singular value, 553 
spectral norm, 553 

Inferential control, 434 

Inner product, 554 

Input constraint, 201, 419 
acceptable control, 202, 245 
anti-windup, 419 
distillation process, 247 
limitation MIMO, 244-249 
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limitation SISO, 201-205 
max-norm, 244 
perfect control, 202, 244 
two-norm, 244 
unstable plant, 203 
Input direction, 80 
Input resetting, 443 
Input selection, 435 
Input uncertainty, 100, 250, 260 
condition number, 260 
diagonal, 100, 103 
generalized plant, 319 
magnitude of, 319 
, see also Uncertainty 
minimized condition number, 260 
RGA, 260 
Input, manipulated, 13 
scaling, 6 
Input-output controllability, 170 
analysis of, 171 
application 
aero-engine, 512-523 
FCC process, 98, 267, 457 
first-order delay process, 214 
neutralization process, 217 
room heating, 216 
condition number, 94 
controllability rule, 209 
decentralized control, 472 
exercises, 266 
feedforward control, 212 
plant design change, 170, 264 
plant inversion, 173 
remarks definition, 172 
RGA analysis, 95 
scaling MIMO, 228 
scaling SISO, 172 
summary: MIMO, 262-265 
summary: SISO, 209-212 
Input /output selection, 424 
Integral absolute error (IAE), 558 
Integral action, 30 
Integral control 
uncertainty, 261 
, see also Decentralized Integral 
Controllability 
Integral square error (ISE), 33 
optimal control, 235 
Integrator, 158 
Integrity, 466 
determinant condition, 469 


, see also Decentralized Integral 
Controllability 
Interaction, 69, 81 
two-way, 96 
Internal model control (IMC), 50, 53, 
88, 154 
Internal model principle, 53 
Internal stability, 137, 148-153 
disturbance model, 153 
feedback system, 150 
interpolation constraint, 151 
two degrees-of-freedom controller, 152 
Interpolation constraint, 151, 229 
MIMO, 229 
RHP-pole, 230 
RHP-zero, 229 
SISO, 180 
Inverse response, 185 
Inverse response process, 27, 47 
loop-shaping design, 47 
LQG design, 379 
P control, 28 
PI control, 30 
Inverse-based controller, 49, 51, 87, 102 
input uncertainty and RGA, 255 
robust performance, 354 
structured singular value (1), 354 
worst-case uncertainty, 256 
ISE optimal control, 183 


Jordan form, 131, 480, 481 


Kalman filter, 115, 377 
generalized plant, 114 
robustness, 382 


1, norm, 560 
Laplace transform, 46, 125 
final value theorem, 46 
Least squares solution, 541 
Left Half Plane (LHP)-zero, 193 
Linear Fractional Transformation (LFT), 
111, 117, 118, 120, 563-567 
factorization of S, 120 
interconnection, 564 
inverse, 565 
MATLAB, 567 
stabilizing controller, 120 
star product, 566 
uncertainty, 305 
Linear matrix inequality (LMI), 368 
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Linear model, 8 
Linear Quadratic Gaussian, see LQG 
Linear Quadratic Regulator (LQR), 376 
cheap control, 235 
robustness, 381 
Linearization, 9 
Local feedback, 201, 221, 222 
Loop shaping, 43, 45, 371-374 
desired loop shape, 46, 52, 89 
disturbance rejection, 51 
flexible structure, 57 
Robust Performance, 296 
slope, 46 
trade-off, 44 
, see also Heo loop shaping 
Loop transfer function, 24 
Loop transfer recovery (LTR), 374, 384— 
385 
LQG control, 26, 43, 375-383 
H2 optimal control, 389 
controller, 377 
inverse response process, 379 
MATLAB, 379 
problem definition, 376 
robustness, 381, 382 
Lyapunov equation, 133, 136, 482 


Main loop theorem, 344 
Manipulated input, see Input 
MATLAB files 

coprime uncertainty, 403, 405 

LQG design, 379 

matrix norm, 556 

mixed sensitivity, 65 

model reduction, 497 

RGA, 548 

vector norm, 556 
Matrix, 124 

exponential function, 124 

inverse, 532 
Matrix inversion lemma, 532 
Matrix norm, 78, 551 

Frobenius norm, 552 

induced norm, 552 

inequality, 555 

MATLAB, 556 

max element norm, 552 

relationship between norms, 555 
Maximum modulus principle, 181 
Maximum singular value, 80 
McMillan degree, 137, 479 
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MeMillan form, 145 
Measurement, 13 
cascade control, 456 
Measurement selection, 433 
distillation column, 434 
MIMO system, 69 
Minimal realization, 137 
Minimized condition number, 543, 545 
input uncertainty, 260 
Minimum phase, 21 
Minimum singular value, 80, 263 
aero-engine, 516 
output selection, 432 
plant, 233, 245 
Minor of a matrix, 139 
Mixed sensitivity, 63, 296 
disturbance rejection, 507 
general control configuration, 109 
generalized plant, 110 
Heo optimal control, 392, 506 
RP, 296 
weight selection, 506 
Mixed sensitivity (S/KS), 64 
disturbance process, 64 
generalized plant, 393 
MATLAB, 65 
MIMO plant with RHP-zero, 92 
MIMO weight selection, 90 
Mixed sensitivity (S/T) 
generalized plant, 396 
Modal Truncation, 480 
Model, 13 
derivation of, 8 
scaling, 7 
Model matching, 412, 491 
Model predictive control, 44 
Model reduction, 479-499 
aero-engine model, 487 
balanced residualization, 483 
balanced truncation, 483 
coprime, 496 
error bound, 486, 497 
frequency weight, 498 
Hankel norm approximation, 168, 
484-486 
MATLAB, 497 
modal truncation, 480 
residualization, 481 
steady-state gain preservation, 488 
truncation, 480 
unstable plant, 496-497 
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Model uncertainty, see Uncertainty 
Moore-Penrose inverse, 542 

Lu, see Structured singular value 
p-synthesis, 356-365 

Multilayer, 428 

Multilevel, 428 

Multiplicative property, 78, 166, 553 
Multivariable stability margin, 334 
Multivariable zero, see Zero 


Neutralization process, 217-222, 571 
control system design, 221 
mixing tank, 218 
plant design change 

multiple pH adjustments, 221 
multiple tanks, 219 

Niederlinski index, 469 

Noise (n), 13 

Nominal Performance (NP), 3, 293, 322 
Nyquist plot, 293 

Nominal Stability (NS), 3, 322 

Non-causal controller, 193 

Non-minimum phase, 22 

Norm, 549-561 

, see also Matrix norm 
, see also Signal norm 
, see also System norm 
, see also Vector norm 

Normal rank, 233 

Notation, 11 

Nyquist D-contour, 158 

Nyquist array, 86 

Nyquist plot, 19, 34 

Nyquist stability theorem, 158 

argument principle, 160 

generalized, MIMO, 158 

SISO, 28 


Observability Gramian, 136, 482 
Observability matrix, 136 
Observer, 415 
Hoo loop shaping, 415 
One degree-of-freedom controller, 24 
Optimization, 427 
closed-loop implementation, 429 
open-loop implementation, 429 
Optimization layer, 425 
look-up table, 433 
Orthogonal, 79 
Orthonormal, 79 
Output (y), 13 


primary, 13, 447 
secondary, 13, 447 
Output direction, 79, 227, 228 
disturbance, 227, 242 
plant, 79, 227 
pole, 144, 227 
zero, 143, 227 
Output scaling, 6 
Output uncertainty, see Uncertainty 
Overshoot, 31 


Padé approximation, 132, 193 
Pairing, 460, 463, 466, 468 
aero-engine, 520 
, see also Decentralized control 
Parseval’s Theorem, 388 
Partial control, 450 
“true”, 451, 456 
distillation process, 458 
FCC process, 267 
Partitioned matrix, 533, 534 
Perfect control, 173 
non-causal controller, 193, 194 
unstable controller, 194 
Performance, 31, 169, 227 
Hoonorm, 85 
frequency domain, 34 
limitations MIMO, 227 
limitations, SISO, 169 
time domain, 31 
weight selection, 61 
weighted sensitivity, 61, 85 
worst-case, 347, 364 
, see also Robust performance 
Performance Relative Gain Array (PRGA), 
461, 475 
Perron root (p(| A |)), 465, 556 
Perron-Frobenius theorem, 556 
Perturbation, 323 
allowed, 323 
destabilizing, 326 
, see also Real perturbation 
, see also Uncertainty 
Phase lag 
limitation SISO, 205 
Phase Margin (PM), 35, 37 
LQG, 381 
Phasor notation, 20 
PI-controller, 30 
Ziegler-Nichols tuning rule, 30 
PID-controller, 131 
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cascade form, 206 
ideal form, 131 
Pinned zero, 146 
Plant (G), 13 
, see also Generalized plant (P) 
Plant design change, 170, 219, 264 
neutralization process, 219, 221 
Pole, 138, 138-141 
effect of feedback, 146, 147 
stability, 138 
, see also RHP-pole 
Pole direction, 144 
from eigenvector, 144 
Pole polynomial, 139 
Polynomial system matrix, 142 
Post-compensator, 88 
Power spectral density, 375, 384 
Pre-compensator, 86 
Principal component regression, 542 
Principal gain, 79 
, see also Singular value 
Process noise, 375 
Proper, 5 
Pseudo-inverse, 541 


Q-parameterization, 153 


Rank, 538 
normal rank, 233 
Rate feedback, 505 
Real perturbation, 366 
DG K- iteration, 366 
Lt, 336, 366 
robust stability, 323 
Reference (1), 13, 429 
optimal value, 429 
performance requirement MIMO, 247 
performance requirement SISO, 199- 
201 
scaling, 6, 7 
Regulator problem, 3 
Regulatory control, 425 
Relative disturbance gain (RDG), 472 
Relative Gain Array (RGA, A), 95, 544 
aero-engine, 517 
controllability analysis, 95 
decentralized control, 97, 462—470 
diagonal input uncertainty, 96 
DIC, 468 
element uncertainty, 96 
element-by-element uncertainty, 260 
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input uncertainty, 255, 260 
input-output selection, 437 
MATLAB, 548 
measure of interaction, 463 
non-square, 96, 547 
properties of, 545 
RGA-number, 96, 472, 518 
RHP-zero, 470 
steady-state, 520 
Relative order, 206 
Return difference, 157 
factorization, 461, 562 
RHP-pole, 14, 28, 195, 230, 239 
limitation MIMO, 230, 239 
limitation SISO, 195 
RHP-pole and RHP-zero 
MIMO, 239 
angle between pole and zero, 232 
sensitivity peak, 231 
SISO, 197 
Hoo design, 197 
stabilization, 197 
RHP-zero, 1/4, 22, 49, 185, 235 
aero-engine, 517 
bandwidth limitation, 186 
complex pair, 186 
decoupled response, 237 
FCC process, 267 
high-gain instability, 185 
interaction, 238 
inverse response, 185 
limitation MIMO, 235 
limitation SISO, 49, 185 
low or high frequency, 190 
move effect of, 93, 236 
multivariable, 91 
perfect control, 193, 194 
phase lag, 22 
positive feedback, 191 
RGA, 470 
weighted sensitivity, 180, 188, 230 
performance at high frequency, 190 
performance at low frequency, 188 
Riccati equation, 128 
H.. optimal control, 391 
Heo loop shaping, 417 
controller, 391 
coprime uncertainty, 402 
Kalman filter, 377 
state feedback, 376 
Right half plane (RHP), 14 
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Right Half Plane-pole, see RHP-pole 
Right Half Plane-zero, see RHP-zero 
Rise time, 31 
Robust Performance (RP), 3, 269, 293, 
322, 342 
bh, 342 
H..optimal control, 399 
condition number, 353, 355 
distillation process, 350 
graphical derivation, 294 
input uncertainty, 347-356 
inverse-based controller, 354 
loop-shaping, 296 
mixed sensitivity, 296 
Nyquist plot, 295 
output uncertainty, 355 
relationship to robust stability, 345 
relationship to RS, 299 
SISO, 293, 299 
structured singular value, 296 
worst-case, 347 
Robust Stability (RS), 3, 269, 286, 322, 
323, 331 
MA-structure, 310, 323 
complementary sensitivity, 288 
coprime uncertainty, 327, 400 
destabilizing perturbation, 326 
determinant condition, 323 
gain margin, 291 
graphical derivation, 287 
input uncertainty, 327, 332 
inverse multiplicative uncertainty, 292, 
327 
multiplicative uncertainty, 286 
Nyquist plot, 287 
real perturbation, 323 
relationship to RP, 299 
scaling, 330 
sensitivity, 293 
SISO, 286 
skewed- 1, 336 
small gain theorem, 330 
spectral radius condition, 324 
spinning satellite, 333 
structured singular value (1), 331 
unstructured uncertainty, 325, 326 
Robustness, 99, 105 
Hoonorm, 105 
LQG control, 381 
LTR, 384 
motivating examples, 99 


Roll-off rate, 45 

Room heating process 
controllability analysis, 216 
deriving model, 9 


Scaling, 6-8, 172, 228, 406 
aero-engine, 515 
MIMO controllability analysis, 228 
SISO controllability analysis, 172 
Schur complement, 533 
Schur product, 544 
Schur’s formula, 534 
Second-order system, 38 
Secondary output, 443 
Selector 
auctioneering, 448 
override, 448 
Self-regulation, 199, 212 
Semi-proper, 5 
Seminorm, 549 
Sensitivity function (S), 24-25, 72 
bandwidth (wp), 40 
factorization, 120, 561 
output (So), 72 
, see also Mixed sensitivity 
, see also Weighted sensitivity 
Sensitivity function peak (|| S ||.0), 181, 
231 
MIMO RHP-pole and RHP-zero, 232 
MIMO RHP-zero, 230 
SISO peak(M, Ms), 36 
SISO RHP-pole and RHP-zero, 182 
SISO RHP-zero, 181 
uncertainty, 253-260 
Separation Theorem, 376, 378 
Sequential design, 452, 477 
Servo problem, 3, 378 
Heo loop shaping, 409 
LQG, 378 
non-causal controller, 193 
Setpoint, see Reference (r) 
Settling time, 31 
Shaped plant (G';), 86, 404 
Shaping of closed-loop transfer func- 
tion, 43, see also Loop shaping 
Sign of plant MIMO, 261 
Signal norm, 557 
oo-norm, 558 
lp norm, 558 
1-norm, 558 
2-norm, 558 
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ISE, 558 
power-norm, 558 
Singular approximation, 483 
Singular matrix, 538, 540 
Singular value, 79, 80 
2 x 2 matrix, 538 
Hoonorm, 85 
frequency plot, 83 
inequalities, 539 
Singular value decomposition (SVD), 
79, 537 
2 x 2 matrix, 79 
economy size, 541 
nonsquare plant, 83 
of inverse, 539 
pseudo-inverse, 541 
SVD-controller, 88 
Singular vector, 79, 537 
Sinusoid, 18 
Skewed-p, 336, 347, 355 
Small gain theorem, 162 
robust stability, 330 
Spatial norm, 549 
, see also Matrix norm 
, see also Vector norm 
Spectral radius (p), 535, 554 
Perron root (p(| A |)), 556 
Spectral radius stability condition, 161 
Spinning satellite, 99 
robust stability, 333 
Split-range control, 448 
Stability, 27, 137, 138 
closed-loop, 27 
frequency domain, 155 
internal, 137 
, see also Robust stability 
Stability margin, 36 
coprime uncertainty, 401 
multivariable, 334 
Stabilizable, 137, 197 
strongly stabilizable, 197 
unstable controller, 241 
Stabilizing controller, 120, 153-155 
Star product, 320, 566 
State controllability, 182, 173 
example: tanks in series, 134 
State matrix (A), 124 
State observability, 136 
example: tanks in series, 136 
State-space realization, 123, 129 
hidden mode, 137 
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inversion of, 129 
minimal (McMillan degree), 137 
unstable hidden mode, 137 
, see also Canonical form 
Steady-state offset, 30, 31 
Step response, 32 
Strictly proper, 5 
Structural property, 233 
Structured singular value (u, SSV), 296, 
331, 334 
p-synthesis, 356-365 
complex perturbations, 337 
computational complexity, 366 
discrete case, 367 
DkK- iteration, 356 
distillation process, 358 
interaction measure, 464 
LMI, 368 
nominal performance, 346 
practical use, 369 
properties of, 334 
complex perturbation, 337-342 
real perturbation, 336 
real perturbation, 366 
relation to condition number, 353 
remarks definition, 334 
robust performance, 342, 346, 399 
robust stability, 346 
RP, 296 
scalar, 335 
skewed-, 296, 336, 347 
state-space test, 367 
upper bound, 365 
worst-case performance, 347 
Submatrix (A“’), 532 
Sum norm (|| A ||sum), 551 
Superposition principle, 5, 123 
Supervisory control, 425 
Supremum, 60 
System norm, 162-168, 559 
System type, 46 


Temporal norm, 549 
, see also Signal norm 
, see also System norm 
Time delay, 49, 132, 184, 234 
increased delay, 234 
limitation MIMO, 234 
limitation SISO, 49, 184 
perfect control, 193 
phase lag, 22 
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Time delay uncertainty, 36 
Time response 
decay ratio, 31 
excess variation, 32 
overshoot, 31 
quality, 32 
rise time, 31 
settling time, 31 
speed, 31 
steady-state offset, 31 
total variation, 32 
Total variation, 32 
Transfer function, 3, 24, 125 
closed-loop, 24 
evaluation MIMO, 71 
evaluation SISO, 25 
rational, 5 
state-space realization, 130 
Transmission zero, see Zero, 145 
Transpose (A7), 532 
Triangle inequality, 78, 549 
Truncation, 480 
Two degrees-of-freedom controller, 11, 
152 
Heo loop shaping, 409-414 
design, 55-56 
internal stability, 152 
local design, 114, 440 


Uncertainty, 26, 207, 269, 309, 311 

NA-structure, 312 

additive, 277, 279, 313 

and feedback — benefits, 251 

and feedback — problems, 253 

at crossover, 208 

chemical reactor, 305 

complex SISO, 276-281 

Convex set, 323 

coprime factor, 328, 400 

diagonal, 317 

element-by-element, 311, 316 

feedforward control, 207, 250 

distillation process, 257 
RGA, 251 

frequency domain, 275 

generalized plant, 309 

infinite order, 286 

input, 313, 315, 319, see also Input 
uncertainty 

input and output, 321 

, see also Input uncertainty 


integral control, 261 
inverse additive, 314 
inverse multiplicative, 273, 314 
LFT, 305, 310 
limitation MIMO, 249-262 
limitation SISO, 207-208 
lumped, 272, 315 
modelling SISO, 269 
multiplicative, 272, 278, 280 
neglected dynamics, 271, 282 
nominal model, 282 
Nyquist plot, 276, 281 
output, 250, 313, 315 
parametric, 271, 273, 280, 313 
A-matrix, 304 
gain, 274, 301 
gain and delay, 283 
pole, 302 
repeated perturbations, 305 
time constant, 274 
zero, 302 
physical origin, 271 
pole, 281 
RHP-pole, 302 
RHP-zero, 302, 307 
state space, 303 
stochastic, 273 
structured, 273 
time-varying, 365 
unmodelled, 271, 285 
unstable plant, 302 
unstructured, 273, 313 
weight, 279, 280 
Unitary matrix, 537 
Unstable hidden mode, 137 
Unstable mode, 139 
Unstable plant 
frequency response, 20 


Valve position control, 446 
Vector norm, 550 
p-norm, 550 
Euclidean norm, 550 
MATLAB, 556 
max norm, 550 


Waterbed effect, 175 

Weight selection, 61, 358 
Heo loop shaping, 406, 520 
mixed sensitivity, 506 
mixed sensitivity (S/KS), 90 
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performance, 61, 358 
Weighted sensitivity, 61 

generalized plant, 115 

MIMO system, 85 

RHP-zero, 180, 188, 230 

typical specification, 61 
Weighted sensitivity integral, 178 
White noise, 375 
Wiener-Hopf design, 397 


Zero, 141, 141-148 
decoupling zero, 145 
effect of feedback, 146, 147 
from state-space realization, 141 
from transfer function, 142 
input blocking, 145 
invariant zero, 145 
non-square system, 143, 146 
pinned, 146 

Zero direction, 143 

Ziegler-Nichols’ tuning rule, 30 
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